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1 CGGGCAGCAA AGGAGGATGG CGAGGGGCTG ATACTGAACC CGGGAAGGGT 
51 GGGCTGTGCT GAAGCTAGAG CCGGAGCCGG AGCTGGGGCC AGAACCCGAG 
101 CACTGCCATG TCCACGCAGA GACTTCGGAA TGAAGACTAC CACGACTACA 
151 GCTCCACGGA CGTGAGCCCT GAGGAGAGCC CGTCGGAAGG CCTCAACAAC 
201 CTCTCCTCCC CGGGCTCCTA CCAGCGCTTT GGTCAAAGCA ATAGCACAAC 
251 ATGGTTCCAG ACCTTGATCC ACCTGTTAAA AGGCAACATT GGCACAGGAC 
301 TCCTGGGACT CCCTCTGGCG GTGAAAAATG CAGGCATCGT GATGGGTCCC 
351 ATCAGCCTGC TGATCATAGG CATCGTGGCC GTGCACTGCA TGGGTATCCT 
401 GGTGAAATGT GCTCACCACT TCTGCCGCAG GCTGAATAAA TCCTTTGTGG 
451 ATTAT6GTGA TACTGTGATG TATGGACTAG AATCCAGCCC CTGCTCCTGG 
501 CTCCGGAACC ACGCACACTG GGGAAGACGT GTTG TG GACT TCTTCCTGAT 
551 TGTCACCCAG CTGGGATTCT GCTGTGTCTA TTTTGTGTTT CTGGCTGACA 
601 ACTTTAAACA GGTGATAGAA GCGGCCAATG GGACCACCAA TAACTGCCAC 
651 AACAAT6AGA CGGTGATTCT GACGCCTACC ATGGACTCGC GACTCTACAT 
701 GCTCTCCTTC CTGCCCTTCC TGGTGCTGCT GG I I I I CATC AGGAACCTCC 
751 GAGCCCTGTC CATCTTCTCC CTGTTGGCCA ACATCACTAT GCTGGTCAGC 
801 TTGGTCATGA TCTACCAGTT CATTGTTCAG AGGATCCCAG ACCCCA6CCA 
851 CCTCCCCTTG GTGGCCCCTT GGAAGACCTA CCCTCTCTTC TTTGGCACAG 
901 CG A I I I I I IC ATTTGAAGGC ATTGGAATGG TTCTGCCCCT GGAAAACAAA 
951 ATGAAGGATC CTCGGAAGTT CCCACTCATC CTGTACCTGG GCATGGTCAT 
1001 CGTCACCATC CTCTACATCA GCCTGGGGTG TCTGGGGTAC CTGCAATTTG 
1051 GAGCTAATAT CCAAGGCAGC ATAACCCTCA ACCTG CCCAA CTGCTGGTTG 
1101 TACCAGTCAG TTAAGCTGCT GTACTCCATC GGGATCTTTT TCACCTACGC 
1151 ACTCCAGTTC TACGTCCCGG CTGAGATCAT CATCCCCTTC TTTGTGTCCC 
1201 GAGCGCCCGA GCACTGTGAG TTAGTGGTGG ACCTGTTTGT GCGCACAGTG 
1251 CTGGTCTGCC TGACATGCAT CTTGGCCATC CTCATCCCCC GCCTGGACCT 
1301 GGTCATCTCC CTGGTGGGCT CCGTGAGCAG CAGCGCCCTG GCCCTCATCA 
1351 TCCCACCGCT CCTGGAGGTC ACCACCTTCT ACTCAGAGGG CATGAGCCCC 
1401 CTCACCATCT TTAAGGACGC CCTGATCAGC ATCCTGGGCT TCGTGGGCTT 
1451 TGTGGTGGGG ACCTATGAGG CTCTCTATGA GCTGATCCAG CCAAGCAATG 
1501 CTCCCATCTT CATCAATTCC ACCTGTGCCT TCATATAGGG ATCTGGGTTC 
1551 GTCTCTGCAG CTGCCTACCC CTGCCCCATG TGTCCCCCGT TACCTGTCCT 
1601 CAGAGCCTCA GGTATGGTCC AGGCTCTGAG GAAAGTCAGG GTTGCTGTGT 
1651 GGGAACCCCT CTGCCTGGCA CCTGGATACC CTGGGCCAGG TAACCTGAGG 
1701 GCAGGGGAGA GGTGGGGTGG CAGACACGCA GAAGTGCTAC TAGTGACAGG 
1751 GCTGCCATCG CTCACCTGTA CCTATTTACA CCCAGAACTT TCCAGCTCCC 
1801 CCTCATCATG CCTCCTCCTT CCTACCTGCC TCCCCTCTGC TG GTGC ACCT 
1851 CGCCCAACTC ATTCTTACTG CACAGTTCAC TTTATTTAAC AATTTTCATG 
1901 TCCCCCATCT CGCTCTGTGC CCCTCCCCAC CAGGGCTTCA GCAGGAGCCC 
1951 TGGACrCATC ATCAATAAAC ACTGTTACAG CAAAAAAAAA AAAAAAAAAA 
2001 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
2051 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA (SEQ ID N0:1) 



features: 

S'UTR: 1-107 

start Codon: 108 

Stop codon: 1536 

3'UTR: 1539 
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HOMOLOGOUS PROTEINS: 
TOD BLAST Hits: 



CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 
CRA 



89000000199482 
89000000199480 
89000000199481 
89000000197173 
89000000195851 
18000005127815 
89000000194855 
89000000197171 
89000000197172 
18000005102492 



/altid=gi 
/alticl=gi 
/al tid=gi 
/altid=gi 
/al ti d=gi 
/a1tid=gi 
/al ti d=gi 
/a1tid=gi 
/a1tid=gi 
/al ti d=gi 



7297404 
7297402 
7297403 
7294781 
7293314 
7509795 
7292192 
7294779 
7294780 
2429516 



/def=gb 
/def=gb 
/def=gb 
/def=gb 
/def=gb 



AAF52663 
AAF52661 
AAF52662 
AAF50116 
AAF48694 



/def=pir|lT26845 
/def=gb|AAF47603 
/def=gb|AAF50114 
/def=gb|AAF50115 
/def-gb|AAB71045 



11 CAE003. 
II (AE003. 
II CAE003. 
,11 (AE003. 
,11 CAE003. 
hypotheti . 



CAE003, 
(AE003. 
(AE003. 
(AF025. 



BLAST dbEST hits: 

gi 1 5422591 /dataset-dbest /taxon=9606 . . . 
gi 1 3648072 /dataset^dbest /taxon=9606 . . . 
gi 1 5746200 /dataset=dbest /taxon=9606 . . . 
gi 1 10249244 /dataset==dbest /taxon=96... 
gi 1 8612353 /dataset^dbest /taxon=960. . . 
gi 1 10083945 /dataset=dbest /taxon=960. . . 

EXPRESSION INFORMATION FOR MODULATORY USE: 
library source: 
From BLAST dbEST hits; 



91 
91 
91 
91 
91 



5422591 testis 
3648072 Testis 
5746200 Brain meningiomas 
10249244 Brain normal 
8612353 Head-neck 
10083945 colon 



Score 
330 
330 
330 
268 
265 
258 
253 
252 
252 
250 



Score 
1400 
1017 
730 
642 
329 
313 



E 

4e-89 
4e-89 
4e-89 
le-70 
le-69 
2e-67 
5e-66 
8e-66 
8e-66 
3e-65 



E 

0.0 

0.0 

0.0 

0.0 

le-87 

7e-83 



From tissue screening panels: 
Human whole liver 
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1 m«;T0RL RNED YHDYSSTDVS PEESPSE6LN NLSSPGSYQR FGQSNSTTWF 

51 StIiSLlkgS Igtgllglpl avknagivmg pislliigiv avhcmgilvk 

im rAHHFCRRLN KSFVDYGDW MYGLESSPCS WLRNHAHWGR RWDFFLIVT 
151 Sfc^C™ FLADNFKQVI EAANGTTNNC HNNETVILTP TMDSRLYMLS 
201 FLPFLVLLVF IRNLRALSIF SLLANITMLV SLVMIYQFIV QRIPDPSHLP 
7^1 r VAPWKTYPL FFGTAIFSFE GIGMVLPLEN KMKDPRKFPL ILYLGMVIVT 
301 ILYK^^^G y[qFGANIQG SITLNLPNCW LYQSVKLLYS IGIFFTYALQ 
351 FYVPAEIIIP FFVSRAPEHC ELWDLFVRT VLVCLTCILA ILIPRLDLVI 
40i sSsSSA LALIIPPLLE VTTFYSEGMS PLTIFKDALI SILGFVGFW 
451 GTYEALYELI QPSNAPIFIN STCAFI (SEQ ID NO. 2) 



Functional domains and key regions: 

[1] PDOCOOOOl PSOOOOl ASN^GLYCOSYLATION 

N-glycosylation site 

Number of matches: 7 

1 31-34 NLSS 

2 45-48 NSTT 

3 110-113 NKSF 

4 174-177 NGTT 

5 183-186 NETV 

6 225-228 NITM 

7 470-473 NSTC 

[21 PDOC00005 PS00005 PKC^PHOSPHO_SITE 
protein kinase c phosphorylation site 

Number of matches: 2 

1 3-5 TQR 

2 334-336 SVK 

[3] PDOC00006 PS00006 CK2_PH0SPH0^SITE 
casein kinase II phosphorylation site 

Number of matches: 4 

1 15-18 5STD 

2 20-23 SPEE 

3 24-27 SPSE 

4 112-115 SFVD 

[4] PDOC00007 PS00007 TYR_PHOSPHO«SITE 
Tyrosine kinase phosphorylation site 

7-14 RNEDYHDY 
[5] PDOC00008 PS00008 MYRISTYL 

N-myristoylation site 

Number of matches: 7 

1 42-47 GQSNST 

2 59-64 GNIGTG 

3 67-72 GLPLAV 

4 175-180 GTTNNC 

5 342-347 GIFFTY 

6 404-409 GSVSSS 

7 451-456 GTYEAL 

[6] PDOC00009 PS00009 AMIDATION 
Amidation site 

138-141 WGRR 

MPmhrane spanning structure and domains; 
— Helix Begin EnB Score certainty 

1 52 72 0.668 Putative 

2 75 95 2.032 Certain 

3 143 163 1.799 Certain 
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4 


193 


213 


5 


216 


236 


6 


258 


278 


7 


289 


309 


8 


335 


355 


9 


375 


395 


10 


398 


418 


11 


437 


457 



1.467 Certain 

1.884 Certain 

1.566 certain 

2.126 Certain 

1.378 certain 

1.332 certain 

1.748 certain 

1.533 certain 



FIGURE 2B 



Docket No.: CL001062CON 
Serial No.: TO BE ASSIGNED 
Inventors: WEI, Ming-Hui et al. 
Title: ISOLATED HUMAN TRANSPORTER PROTEINS... 



>^T89O0§000M^ /def=gblAAF52663 II (AE003621) 

' CG13384 gene product [alt 3] [Drosophila melanogaster] 
/org=Drosophila melanogaster /taxon=7227 /dataset=nraa 
/lenqth-486 
Length = 486 

?S2!;?Uiel'2 $j5^J"a3«::'''pSsUi$« = 252/425 CeW. caps = 32/425 (7%) 
Query: 47 ^FQTL^GNigrc^^^^^^^ 106 

Sbjct: 78 TSNFDTLVHLLKGNIGTGILAMPDAFKNAGLYVGLFGTMIMGAICTHCMHMLVNCSHELC 137 

Query: 107 rrlnksfvdygdtvmyglesspcswlrnhahwgrrwdfflivtqlgfccw 166 

RR + +D+ + ES P LR ++ RR+V FL +TQ+GFCCVYF+F+A N 

Sbjct: 138 rrfqqpsldfsevaycsfesgplg-lrrysmlarrivttflfitqigfccvyflfvalni 196 
Query: 167 kqvieaangttnnchnnetviltptmdsrlymlsflpflvllvfirnlralsifsli^ni 226 

K V++ H + M ++Y+L L ++LL +RNL+L+ SL+A + 

sbjct: 197 KDVMD HYYK MPVQIYLLIMLGPMILLNLVRNLKYLTPVSLVAAL 240 

Query- 227 tmlvslvmiyqfivqripdpshlplvapwktyplffgtaifsfegigmvlplenkmkdpr 286 

query, lu mLv:>LV|v,xTi^^y + VA W T PL+FGTAI++FEGIG+VLPLEN M+ P 

Sbjct: 241 LTVAGLAITFSYMLVDLPDVHTVKPVATWATLPLYFGTAIYAFEGIGWLPLENNMRTPE 300 

Query: 287 kf— PLILYLGMVIVTILYISLGCLGYLQFGANIQGSITLNLP-NCWLYQSVKLLYSIG 342 
query, -to/ Nr ^^^^ GYL++6 +++GSITLNLP L Q V++ ++ 

Sbjct: 301 DFGGTTGVLNTGMVIVACLYTAVGFFGYLKYGEHVEGSITLNLPQGDTLSQLVRISMAVA 360 

Query: 343 IFFTYALQFYVPAEIIIPFF VSRAPEHCELWDLFVRTVLVCLTCILAILIPRLD 397 

IF +Y LQFYVP 1+ PF +RA + V +R VLV T +LA IP L 
sbjct: 361 IFLSYTLQFYVPVNIVEPFVRSHFDTTRAKDLSATV LRWLVTFTFLLATCIPNLG 415 

Query: 398 LVISLVGSVSSSALALIIPPLLE\m-FYSEGMSPLT--IFKDALISILGFVGFWGTY^ 455 

+ISLVG+VSSSALALI PP++EV TFY+ G ++KD LI I G 6FV GT+ + 

Sbjct: 417 SIISLVG.AVSS5ALAITAPPIIEVITFYNVGYGRFNWMLWKDVLILIF6LCGFVFGTWAS 476 

Query: 456 LYELI 460 
L +++ 

Sbjct: 477 LAQIL 481 (SEQ ID N0:4) 

>CRA 1 89000000199481 /a1tid=gi 1 7297403 /def=gbiAAF52662 1 1 CAE003621) 
CG13384 gene product [alt 2] [Drosophila melanogaster] 
/org=Drosophila melanogaster /taxon=7227 /dataset=nraa 
/lenqth=482 
Length = 482 

score = 330 bits (837). Expect _= .2e-89 ,75.. 
Identities = 184/425 (43%), Positives = 262/425 (6156), Gaps = 32/425 

Query: 47 ttvifqtlihllkgnigtgllglplavknagivmgpislliigivavhoigilw 106 

T+ F TL+HLLKGNIGTG+L +P A KNAG+ +G +I+G + HCM +LV C+H C 

Sbjct: 74 tsnfdtlvhllkgnigtgilampdafknaglyvglfgtmimgaicthcmhmlvncshelc 133 

Query: 107 RRLNKSFVDYGDT>MYGLESSPCSWLRNHAHWGRRWDFFLINn;QLGFCCW 166 
RR + +D+ + ES P LR ++ RR+V FL +TQ+GFCCVYF+F+A N 

Sbjct: 134 rrfqqpsldfsevaycsfesgplg-lrrysmlarrivttflfitqigfccvyflfvalni 192 
Query: 167 kqvieaangttnnchnnetviltptmdsrlymlsflpflvllvfirnlralsifsllani 226 

K V++ H + M ++Y+L L ++LL +RNL+ L+ SL+A + 

Sbjct: 193 KDVMD HYYK MPVQIYLLIMLGPMILLNLVRNLKYLTPVSLVAAL 236 

Query: 227 tmlvslvmiyqfivqripdpshlplvapwktyplffgtaifsfegigiwlplenkmkdpr 286 

query, cat x^x.^ ^ ^ + +++ +PD + VA W T PL+F6TAI++FEGIG+VLPLEN M+ P 
Sbjct: 237 LTVAGLAITFSYMLVDLPDVHTVKPVATWATLPLYFGTAIYAFEGIGWLPLENNMRTPE 295 
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Query: 287 KF— PLILYLGMVIVTILYISLGCLGYLQFGANIQGSITLNLP-NCWLYQSVKLLYSIG 342 

F +L GMVIV LY ++G GYL++G +++GSITLNLP L Q V++ ++ 
sbjct: 297 DFGGTTGVLNTGMVIVACLYTAVGFFGYLKYGEHVEGSITLNLPQGDTLSQLVRISMAVA 356 

Query: 343 IFFTYALQFYVPAEIIIPFF VSRAPEHCELWDLFVRTVLVCLTCILAILIPRLD 397 

IF +Y LQFYVP 1+ PF +RA + V +R VLV T +LA IP L 
Sbjct: 357 IFLSYTLQFYVPVNIVEPFVRSHFDTTRAKDLSATV LRWLVTFTFLLATCIPNLG 412 

Query: 398 lvislvgsvsssalaliippllevttfysegmsplt— ifkdalisilgfvgfwgtyea 455 

+ISLVG+VSSSALALI PP++EV TFY+ G -H-KD LI I G GFV GT+ + 

Sbjct: 413 SIISLV6AVSSSALALIAPPIIEVITFYNVGYGRFNWMLWKDVLILIFGLCGFVFGTWAS 472 

Query: 456 lyeli 460 
L +++ 

sbjct: 473 LAQIL 477 (SEQ ID NO: 5) 

>CRA| 18000005127815 /altid=gi | 7509795 /def=pi r | IT26845 hypothetical 
protein Y43F4B.7 - Caenorhabditis elegans 
/org=caenorhabditis elegans /taxon=6239 /dataset^nraa 
/lenath-607 
Length = 607 

score = 258 bits (652), Expect = le-67 

Identities = 142/418 (33%), Positives = 235/418 (55%). Gaps = 19/418 (4%) 
Query: 40 rfgqsnsttofqtlihllkgnigtgllglplavknagivmgpislliigivavhcmgilv 99 

R NS T Q IH++K +GTGLL LPLA K++G+ +G I ++I ++ -H-CM +V 
Sbjct: 42 RLPTENSLTPEQAFIHMVKAMLGTGLLSLPLAFKHSGLFLGLILTVLICLICLYCMRQW 101 

Query: 100 kcahhfcrrlnksfvdygdtvmyglesspcswlrnhahwgrrwdfflivtqlgfccvyf 159 

AH C R + +DY + + +E P W++ + ++ +++V+ + -H-QLGFCCVYF 
Sbjct: 102 FAAHFVCNRNGRDLIDYANIMRGAVEMGP-PWIKRNGYFFKQLVNVNMFISQLGFCCVYF 160 

Query: 160 vfladnfkqvieaangttnnchnnetviltptmdsrlymlsflpflvllvfirnlralsi 219 

VF+ADN + NN T I + ++ML L ++ + iR L L+ 

sbjct: 161 VFMADNLEDFF NNNTSI— HLSKAVWMLLLLIPMLSICSIRRLSILAP 206 

Query: 220 fsllanitmlvslvmiyqfivqripdpshlplvapwktyplffgtaifsfegigmvlple 279 

F++ AN+ +V++ ++ F + + S LP PLFFGT +F+FEG+ +++P+E 

sbjct: 207 famaanwywavawlffflsdlrpisslpwfgkatdlplffgtvmfafegvavimpie 266 
Query: 280 nkmkdprkfpl— ilylgmvivtilyislgclgylqfganiqgsitlnlpncwlyqsvk 336 

N+M+ P F +L ++V ++ G GYL G +++ + TLNLP YQ++K 

sbjct: 267 nrmqsphafiswngvlnssclwlaifsvtgfygylslgndvkdtatlnlpmtpfyqtik 326 
Query: 337 llysigifftyalqfyvpaeiiipffvsrapehcelwdlfvrtvlvcltcilailiprl 396 

L++ I +Y LQFYVP EI + +P ++ R V LTC +A LIP L 

sbjct: 327 lmfvacimisyplqfyvpmeriekwitrkipvdkqtlyiyiarysgviltcaiaeliphl 386 
Query: 397 dlvislvgsvsssalaliippllevttfyseg-mspltifkdalisilgfvgfwgty 453 

L ISL+G+ S +++AL+ PP +E+ T Y++ +S K+ ++ F+GF GTY 

sbjct: 387 ALFISLIGAFSGASMALLFPPCIELLTSYAKNELSTGLWIKNIVLLTFAFIGFTTGTY 444 (SEQ 
ID NO: 6) 

>CRA| 335001101719045 /dataset=FastAlert /length=476 
/a1 ti d=De rwen 1 1 WO200071709 . 21 
Length = 476 
score = 909 bits (2324). Expect =0.0 
Identities = 450/476 (94%). Positives = 459/476 (95%) 
Frame = +3 

Query: 108 MSTQRLRNEDYHDYSSTDVSPEESPSEGLNNLSSPGSYQRFGQSNSTTWFQTLIHLLKGN 287 

MSTQRLRNEDYHDYSSTDVSPEESPSEGLNNLSSPGSYQRFGQSNSTTV/FQTLIHLLKGN 
sbjct: 1 MSTQRLRNEDYHDYSSTDVSPEESPSEGLNNLSSPGSYQRFGQSNSTTWFQTLIHLLKGN 60 
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Query: 288 igtgllglplavknagivmgpislliigivavhcmgilvkcahhfcrrlnksfvdygdtv 467 

IGTGLLGLPLAVKNAGIVMGPISLLIIGIVAVHCMGILVKCAHHFCRRLNKSFVDYGDTV 

sbjct: 61 igtgllglplavknagivmgpislliigivavhcmgilvkcahhfcrrlnksfvdygdtv 120 

Query: 468 myglesspcswlrnhahwgrrwdfflivtqlgfccvyfvfladnfkqvieaangttnnc 647 

myglesspcswlrnhahwgrrwdfflivtqlgfccvyfvfladnfkqvieaangttnnc 
sbjct: 121 myglesspcswlrnhahwgrrwdfflivtqlgfccvyfvfladnfkqvieaangttnnc 180 

Query: 648 HNNETVILTPTMDSRLYMLSFLPFLVLLVFIRNLRALSIFSLLANITMLVSLVMIYQFIV 827 

hnnetviltptmdsrlymlsflpflvllvfirnlralsifsllanitmlvslvmiyqfiv 
sbjct: 181 hnnetviltptmdsrlymlsflpflvllvfirnlralsifsllanitmlvslvmiyqfiv 240 

Query: 828 qripdpshlplvapwktyplffgtaifsfegigmvlplenkmkdprkfplilylgmvivt 1007 

qripdpshlplvapwktyplffgtaifsfegigmvlplenkmkdprkfplilylgmvivt 
Sbjct: 241 qripdpshlplvapwktyplffgtaifsfegigmvlplenkmkdprkfplilylgmvivt 300 

Query: 1008 ilyislgclgylqfganiqgsitlnlpncwlyqsvkllysigifftyalqfyvpaeiiip 1187 

ilyislgclgylqfganiqgsitlnlpncwlyqsv+lly gi ty lqfyv a+ii+p 
sbjct: 301 ilyislgclgylqfganiqgsitlnlpncwlyqsvellylggicltyplqfyvsakiivp 360 

Query: 1188 ffvsrapehcelwdlfvrtvlvcltcilailiprldlvislvgsvsssalaliipplle 1367 

VS + C L+VDL + + ++C tcilailiprldlvislvgsvsssalaliipplle 
Sbjct: 361 vivswvckcctlmvdlgigsamlcktcilailiprldlvislvgsvsssaukliipplle 420 

Query: 1368 vttfysegmspltifkdalisilgfvgfwgtyealyeliqpsnapifinstcafi 1535 

vttfysegmspltifkdalisilgfvgfwgtyealyeliqpsnapifinstcafi 
sbjct: 421 vttfysegmspltifkdalisilgfvgfwgtyealyeliqpsnapifinstcafi 476 (seq 
ID NO: 7) 



Hmmer search results (Pfam): 

Model Description ^ . Score E-y^1ue N 

PF01490 Transmembrane amino acid transporter protein 223.3 ^ -56-03 1 
PF01091 PTN/MK heparin-binding protein family 2.0 9.5 1 

Parsed for domains: 

Model Domain seo-f seo-t hmm-f h mm-t score ^-value 

PF01091 iTl 192 208 .. 1 TTT: 2.0 9.5 

PF01490 1/1 71 451 .. 1 467 [] 223.3 3.5e-63 
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1 AAAACCAGAA AGTCAGATAG TCCCTGTCTC ATCTTCAATC TOTTATTTG 
51 TTATTAGTCT GTTCAGGCTT TCCATTTCTT CCT^TTrCA ATCTTGGTAG 
101 GTTGTATTTT TCTAGGGATT TCTCCATTTC ATCTG GGTTA TCCAATATGT 
151 TGGCAAATAA TTGTTCACAA TAGCCCCATA TGATTCTTTr TATTTCTGAA 
201 GCTTCTGTTG TAGTGTCTTC ACTTTCATTT TTTATTTTAT TA6TCTTCTT 
251 TTTTTTCTTA GACTA6CAAA GGGTACGTCA ATTTAATT TT TTCCAAAAAA 
301 TCAACrCTAG TTTTATTGAT TTT GTCTG TT I ICTG ^IGTCTAITT 
^51 GATTTATTTC TGTTCTGCTC TTTCTTTTCT TCI I I lAACT TTGGGTCTTG 
401 ^^Vrri ™CCAGT TTCCTGAGGT ATAATGT7AA ACTATTTA^T 
451 AGGTCTCTTT CTTCTCTTTT AATGTAGGCA TTTATGTCTA TAAACTTCTC 

50i ^otS^ ACTTTTGCTG cattccataa gctttggtat gttatgtctc 
551 cataggcaat tgtctgaaaa tattttttaa atttcccgtt Igatttcatc 
601 tttgacccac tggttgtttc gaagcatagg actgggtatg gtggctcaca 
651 cctataatct cagcactttg ggaggccgag gtgggcagat cacctgaggt 

701 SJSI^ AGACCGCCTG ACCAACATGG TGAACCTCGT CTCTACTAAJ 
751 AATACAAAAT TAGTCGCGCG TGGTGGCACG TGCCTGTAAT CCCAGCTACT 
801 TGGGAGGCTG AGACAGGA6A ATCGCTTGAA TCCAGGA66C AGAGGTTGCA 
851 GTGAGCCAAG ATTGCACCAC TGCACTCCAG ATGGGGCAAT AAGAGCGCAA 
901 CTTTGTCTCA AAGAAAAAAA AAA6CGTGTT GTTTAATTTC CACATATTTG 
951 TGAATTTTCC AAGATTCCTC CTGTCATTGA TTTCTAGCTT CATATTATTA 
1001 TGGTCTGAAA GAATATTAAT ATGATTTCAA TCTTCTTAAA JTTAAGGCTT 
1051 GTTTTTTGGA CTAGCATATG GTCTATTCTA GAGAATGTTT CAAOTGTGTT 
1101 AGAGAAAAAA TGTGTATTCT 6TTTCTGTTG AATGGAAAGT TCTGTATATA 
1151 TCTGTTAGGT TCATTTGGTT TAAAGTGCAA TTCAAGTTCA 1^^^^ 
1201 ATTTTCTCTC TAGTTGCTCT ATCCATTGTT GAAAGTGGGA TATTGACCCT 
1251 CCTACTATTG TGTTGCTATC TATTTCTCCC TTCATGGCCA TTAATATTAG 
1301 TTGTATGTAT TTAGGTGCTC CAATTTTGCA TGCATATATA TTTACA6TTG 
1351 TCTOTCrTG ATG AATTGA C CCCTTTATTA TTAAACAATG ATCTTCTCT6 
1401 TCTCTTGTGA CAGTTTTTGA CTTGAAGCCT ATTTGTTACA TAATTGTTAA 
1451 AGGAAAAGTC TGTAACAAGG AGGTAAAAGG AGAAGCCTAG ATAATACAAT 
1501 ACTGAAATGT TGCCATCCAT TTAAAAATGT TACTTTAAAA ATTT6AATGT 
1551 ATTAAAAGAT AATGTTGCCC TACCCCACAG TTCCATTTCC AGTAGCAACC 
1601 ACAGATGATA GTTTGTGTAT GCTTCTGAAA AATTGGAAGT TTTAAAAATA 
1651 TGCATATTTC TTATTATAAA AGCAATACAA ACTCATCGAG ATGTGTAAAA 
1701 GAAAATACAG CCAGTGTAAA AATTAGCAAT ATTTCACAAA CCCACAACTC 
1751 AAGGGACAGT GCTCTTCGAC TGGACTCTGC CCCATGCCCA AGATCAATGC 
itU^t Zf^Z^^^ jQr^*rr-rr^QQ ACTrrrrAGC GCCCAGGAAC ATAGTCCTTC 
1851 CAGCAGTGGC AGTAATAGGT CGCCAGGTGG TGCTGTGGAG CAGAGCTCCG 
1901 GAGCTCAGTG AGAAAAAAGG CGCGGCCGCT CAAGGGAGCA CGTGACCTCG 
1951 GCCTCTGGCG TGGGCGGTGG GATCACGTGA TGAGGTCCGG AAGCGGCTGC 

2001 cg^SEaSaX AGGAGGATGG cgaggggctg atactgaacc cgggaagggt 

2051 GGGCTGTGCT GAAGCCAGAG CCGGAGCCGG AGCTGGGGCC AGAACCCGAG 
2101 CAGTGAGTTC CTCCACTGAC GAGTTCCGGC TGGCGGC6CT C6CCGCCTTG 

2151 ^^^iSc Sctcgcctt CCTCCCGGCG tggcagatgc tccaggtcag 
2201 gcactggatc cgcccgggct gtgggtccgc gactccttgg cgtccccggg 
2251 ccgcagctgc ggtacgacgc tgacacccct ctgtgaattg ggcgaagcgt 
2301 ggagatccct tgtccctcgc gctatctccc ttgacctcgt ggggttggga 

2351 tScSS JfGfTTGACr 6ACAGGTGGG GGAAACTGGG GTAGATGGTG 
2401 AAGATAACCC AAAGGACCAT CTAGGGCGTC J-TTCACGCTT CGCACAGGTC 
2451 TCCCCGTTTC CAGCAAATGT CTTGCCCGCT GCGGGAGCGC TGCTTGA6AC 
2501 AGGCTCATAA TGGGTCTTTG GGTCAGAACT GCAAGGACGC TG6GAAGTCG 
2551 TCTGGTGCAG CTCCCTCCTA GGACAGITGG AGAAACTGAG CCCTTACTCC 
2601 GGGAAGGGGT AAGGGCTTGC CTAAGGTCAT CCAGTGAGTT AATCGGAGAC 
2651 CCGGAGACCT GCGACTAGAA TGCAAATGTT CCTAAGCTTC AGCAGCTGTT 
2701 TGCTTTTCGC CACACCGCCT CCTGCGGGAA ACTTCACCTG TGAAAAGGCA 
2751 CTCCTTTCTG TCCCTTTCTC TTTTAGTCCT CTCCCTTTTT AGCTGTCTGC 
2801 ATTTTCCACC GCTGGGGTTG GATTTGCTCT GGGTGTGGTT CCCTGTTTGT 
2851 TCATTATTTT TCTGCAAACT CATCCTTCTG TAGGTTTGGT TTCTAACCTT 
2901 CCTGCATTCT ATGTAAGTCA CACCAAAATA TGAAATATGA ATC6GAATGT 
2951 GCTTCTGGGA AGATAGGTGG CTGAGCCGAG G TTGTGG AGA GCCCTGACGT 
3001 TAAOTGAAG AATGTAAAGA CCTTTGCTTT ATTTTT^ li^^TTS?? 
3051 GATTTGGGAT TGCTTATTTG GATGGACGTT TTGCAGTTAT TTGAATTTTG 
3101 CTGAAGATAG CATCATGGTG CAATGGACAG AACAGAGATT G6GGAATCAG 
3151 GATATTTTGT CCTAGCTCTG CCGCTTACCT GGCAACCTTA AGTGACTCGC 
3201 GTTTGGGTTT CTCAGTCTAG ACAGTGATGG AATTGAATTC T TAAGGGC CC 
3251 CTTCTGCTGT GATCTGGATG TTGTGCATCT TTCTAGGCTT GTTrTTTTGT 
3301 TTGTTTGTTT TTAAATAGAG ATGAGGTCTC ACTATGCTGC CCAGGCTGAT 
3351 CTCAAACTCC TGGGCTCAAG TGATCCTCCC ACCTTGGCCT CCCAAAGTAT 
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CATCTCAAAT CTTTCATrTG ACTCATCC^ ^g^J 
6351 GAGTCAGCCT TTGTOTOTQG GCCCTGOTTT |CTeAtt.^^ AnTAGTGAT 
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6801 ATTGTTTCAG AATAACGGAT GACACTTTTA GCTTGCAAAC AAGGGGCGCC 
6851 AATGCGTGAA TTCTGGTAGG AGGTGAGGCC TAGGGTGTAC CTATCATAAT 
6901 AAGATCATAT ATTTnTGTA GTGCTTTATA TAAATCTACC TATAATCAAG 
6951 ATTACCTAGG AAGCTAGTTA AAAATAAAAC GCCTCTT6CC T6TAATCCCA 
7001 TCACTTTGAG AGGCTGAGAC AGGTGGATCC CTTGAGGTCA A6AGTTTGAG 
7051 ACCAGCCTGG CCAACACGGA GAAACTCCAT CTCTACTAAA AACACAAAAA 
7101 ATTATCT6GG CATGGTGATG GACGCCTGTA ATCCCAGCCA CTCGGGAGGC 
7151 TGAGGCAGGA GAATCGCTTG AACCCAGGAG GCGGAG6TTG CAGT6AGCCA 
7201 AGATCACACC ATTGCACTCC AGCCTGGGCA ACAGAGGGAG ACACCATCTC 
7251 AAAAAAAAAA AAAGAAGACA AAAAGACAAA AACAACAACA AAAAAACATA 
7301 GGCTGGGCAT GGTGACTCAT GCCTGTAATC CCAGCACTTT GGGAGGCCAA 
7351 GGTGGGTGGA CCACCTGAGG TCAAGAGTTT GACACCAGCC TGGCCAACAT 
7401 GATGAAACCC CGTCTCTACT AAAAATAGAA AAAAATTAGC CAGTTGTGGT 
7451 6GCGCATGTC CGTAATCCCA GCTACTCGGG AGGCTAAGAC AGGA6AATTG 
7501 CTTGAACCTG GGAGGCGGAG GTTGCAGC6A GCCAAGATCG CACCACTGCA 
7551 CTCCAGCCTG GGCAACAAGA ATGAAACTCC ATCTCCAOTA AATAAATTAA 
7601 AATAAATAAA TAAAA TAAAA TAAAAT AAAA TGCTAAGGTG GAATCAAGTT 
7651 GGGCCCAGAA ATCTATTTTT TTTTTCCTTG ACGTATGTTT CATTTAACCC 
7701 AATATATCCC AGATATTATC ATTGCAATAT ATAATCAGTA TAAAGATTAT 
7751 TAATTCATGG GATATTTCAC AATTTTTTTG TTACCAGTTC ATTGAAATCT 
7801 AGTGTGCACA TTTCAATTTT ACCCAAGTGT ATTTCAAGTG TAAGATAGCT 
7851 ATTTATGGCT AGTGGTTACf GTACTGGATG CTACAACTTC AGAATATGTT 
7901 ACCATCTATT GATCTTAATC CTCCTTTATT TTGAACAAAC CCAGTCACTA 
7951 AAAAATTGAA ATTGGAATCC TGAAACTTTA GAAGTGAAAG TGTACTTAGA 
8001 AATCATCTAA TGCAGTTTTC TCAATTCTAT ATCAAAATAA GAAAACAACT 
8051 TTGGGATTAG AATGACAGCC AGATTATGTT CTCCTGAGTC CTGAATCCCA 
8101 TGCTGTTAAA ATGGGAACAT TAGCATTTGA ATTTATTAGA AAMTTTCTG 
8151 GCCTTGCCTT AAAAAAAAAA AATCACTGTA GAATTCCCCT TAAAATTGCC 
8201 CACTTCTGAA AA ATTTAA CA CCTACAAATT TTT/^TTTTA AAAATAGAAT 
8251 AAAATTTATT TTATTTTTAA AAATAAAAAT TCA6TTTGCA CATACATTTT 
8301 CCATATTGCA TCCGTTGCAC AAAGTGATTC CACCTGCTCA JlTrTAGTGC 
8351 CCATCTAAAA ATGGCATATT TTGTAGATTG AAGAGCAACA CTTGTCTATT 
8401 TATACAGCTA AAACAATAGT TACATAAGGA AAAAAAAGGA ATGTTTTAAG 
R4?l GTTTGTACAC TTAAATTTTT I I I I I I I I I I TTTTTTTTTG GCCATCAAAC 
8501 ^cS^^ ™ACTCA GTTGCrCACT CTTCTGAGTC TAAATATCTA 
8551 ATGGAGATTT GGACTTTGTG TTCTGTTTAT TGTCCTCAGT AATCTGAAGG 
tItTV:^^^ ^^r-AA^i-r rAr-ATArrTAC AArrCTCATT TAGACAGTTA 
8651 2^ctActX ^fAX^^CTC fc^TAGaGCG GGAACTGGCA ATTGCAGCAA 
8701 TAGACTTGGC TATCAGATTT CATCAAAGGG AGCCTAAGGG CAGTGTG6CC 
8751 ATGGATGCCA GCACTCATGG GGACAGACAG AGAGCAGGAG GAGGAGGCCT 
8801 TGGTTTCCAA AAAGAGCCAT AGAAAGAACT CCGGGGAGTG GCTCTGCCCA 
8851 CTGTCTGATG CTTGAATCCT TACATAACTG CTCTGAGAAA GGGCTTTTGC 
8901 TTGGATTTTr TCAGGGATAA GGGAACAGGC TTTCTCCCAG A6TGATCTGT 
8951 TCTATTTGGA ACAGATCTGT CTTTGATAGA AAGTTCTTCC TTACACCTAG 
9001 JSaAAATCA GCCCTCTTGA CTCrCCACGT ACTGATCCTA GCCCTGCCTG 
9051 ACCTTTGAGG CCCCAAATAA CAAGTCTAAT CCATGTGACA GCMMM.. 

qlol 1 1 i n i i i i i ttgagacgga gtctcgctct gtcgcccagg ctggagtgca 
9151 SJJSSIt ctSSgctcac tgcaagctcc gcctcccggg ttcacaccat 
9201 tctcctgcct c agcctcc cg agtagctggg actacaggcg cccgctacca 

9251 CGCCCGGCTA ATTTTTTGTA TTTTTAGTAG AGACGGCTTT T^GACAGT 
9301 TTTTGTACCC CTCAAGTTGC TAGGTGGAAC CTTCTCAGTG CTTTCAACCA 
9351 TTCCTCATTT AGTTGGTTTC CTACCCCTCTr TGATCCTAGT TCT6ACCCCT 
9401 GGATATACCA CAATTTGTCA TTATCCCCTT TATAGCATGC TGCCTGGAAG 
9451 AGMCACATT ATCTGGCAAT TCTGAGTTGT GTAACATGTA CCCATGTGTA 
9501 A^CT^Sl TGTAGAGTGG ATGGAGCAGA TGTCCACTTA CTGGCCTGTA 
9551 GAGTGGATGG AGCCGAGGTC CACTTACTAG CATGTGGGAT GGATGGAGCC 
960i gatctctI^ CATGATTCTC TCATGTCCTA atgcaaccta gaattgtgtt 
9651 ggtttatttg gcatcttgga tta tattatt gcctt ctgtt gagcttatca 
9701 tcaaccagaa ctcccaagca gaattttttt tttctgtttt caatatgcat 
9751 gcagttgctt agccacatct tccatatccc gcaggcttat tggaccttaa 
9801 atctatagat ttcag cttct tgttgagaag ttttcttagc aatattctgc 
9851 tgtgggtaac ttcagttttt atacacaaca taaagaagtc tctctgcagt 
9901 gtttgagata aattgaacat ctgtaccaag tagacaacag agaggtttct 

9951 CGGTTGCTAG GGAAGGATTG GGCAATTAAT AAGTCCCTGT ATTCCATCCT 
10001 TTCACCTTCA GTAATATATA GGTGTCAACC JAAAGGAAGA AGTTGAGACA 
10051 CAAAATGCAA TTTTTAACAG TTTACTTGAA CTGTTTACTT GAACCAAGTG 

ffi ^l^^^l ^"C^C^G^ C^-^ CS"--- 
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10201 ACTGATACAA AGTGACTTGA CAGGAATTCT CATCACrTTTA CAGAAATAGC 
10251 ATGGATTATT GATGGCCTGT ACATTGTTGG ACTATAGGGT AT6AGTTATG 
10301 ATGTCCAGTG TTAGCATTTT ATGACTTAGT GGTGTCAGTT AGTCTA6AAC 
10351 CCACATAGCA AGTGGCTTCA AGAGGTAATT ATTTAACTCA AGG6GGGAGT 
10401 a^TGAC TGCTCTCACA TTTTAGTGCC TCrCTGGACC CGTAATTTAA 
10451 AGG6ATTCCT CAGATAAAAA GTrTOTTTC TTTCTCACAA GATTCACTTG 
10501 GAAGGTTCTA T CTTCAGAT G CTTTTGGTTT 6TTGGAAGGG ACTAGAAATT 
10551 GGCAGCTTTT TCTTTTTTTC AGTAGAGGCA GGGTCTCACT ATGGTGCCCA 
10601 GGCTGaiSr GAACTCCTGG GCCCAAGTGA TCCTCCCACC TCAGCCTCCC 
10651 CAAATGCTGG GATTACAGGT GTGAGACACT GCACTCAGCT 6CTGTTT6CA 
10701 TAAATAATTA TGTTCATTGA CACCTAGAAT ATTAGTGCTA GAGGGAGTTG 

10751 I^t^-m lAGrrTATGC cccatgcttt ttgcacattt gaaaatggtt 
10801 cacaggtact aagcaaactg ttgacagagg taggcttggc gcctgggcct 
10851 ctctaaactg atttacgagc tt atacctgt atagcaagag 

10901 gttacaatgc tggtattaag atacttcaga gaiiumii IIctcccggc 
10951 cctctagtga gtttaattgc cccagagctg gttggcgtcc XI^I^S 
11001 ctagctcatg agtaaatgaa gctctcatag atttttagcc aagtggctct 
11051 ggcaatgaag ctaggcagga tcgtctctgg gatttccagg tcctttgctg 
11101 gcattttgcc aggtacttcc cttgtgagat agcttggggg tccttcctac 
11151 attgcaattg ttgagagaaa atgcgatctc ccgtggatct ctctggtgcc 
11201 a^Ictggggt gtttccaaag gagtaccctg gcactggacc taaggagagc 
11251 cttcggcgga gcaccatcct ctggcaggtg gtgctgggtg tcggggcagg 
11301 gtggggtgct gtggcagcag ttggaggtcc tgtctcctct caaggtagct 

11351 GAGATAGAGT GCCCAGGCTT AAGGTGGGCA TCCAGCCACA TGCCGGA6GA 
11401 CAGTCTGACG GGCAAGTAGC TGTGCCAGTC TGCCAAGTGT CGGGAGGATT 
11451 TTTGTCATTT TTTATATTAA TGTACCCTTT TTTTGTCACT T6GTTCTTGA 
11501 AGA6CAGGAA GTTGACTCTT TCACTGTGCT GTAATACTCT CTCATAGCAA 
11551 CTGGGACTCT GTAGAGTGGT TGTTTTCAGA TTCTGACAGG GGTCAGGAGA 
11601 TAACGTTTTC TGCTTGGTAC TACCTAGCTG TTGCAGG6CA GGTTACTTCA 
11651 TCTTTTAGCA TTGATTCTTC ATCTTTAAAG TAAGGGGCTT AGAGTGACCT 

U701 g^gSS atttagcaat ggtctctagg attctagggg cc^c^cgc 
11751 tcccaaaatg tgtccttatt gtgtatcttt aagaagccct tgctcttctc 
11801 ttttgtgtag tattaatagt attcctgagt aaatccaccc aggggacacc 
11851 actctcacca ccctcccaac acattgaaag gacatttttt tctctcacca 
11901 ttttaaaaat gagcacatct ataaaaataa aaaggaagaa gagttgtgga 
11951 tgggagatgt tgagacgagg gccagggtga gcacctttca gtttcctggt 

.^^l^ ^^^V^/-A r^r-rxTTCA GCTArrATTT CTCAGTACTC AGGTTGGCAG 
12051 ^^G^ GTO^AGGGT ^^GGCTATAA GAGTTACTGG TGGCCTCCAG 
12101 AGAGTATAGG ATCAGCCTGT GGTCACAGCA GAGAGAAAGA GAACGGCATG 
12151 TGTGGCTCTG 6GATTTGGTG GGAGTTTCAG CAGAATTGGA T6ATCCAGAG 
12201 GGGATTTCTG TTTTCTTTTT TTTTGTTGTT T6TTTGTTTT TTTTTTTGAG 
12251 ATGGAGTTTC GCTCTTGTCG CCCAGGCTGG AGTGCAATGG CACGATCTCG 
17^01 Gcf^CCACA ACCTCCGCCT CCCGGTTTCA AGCGATTCTC CTGCCTCAGC 
12351 C^JcCGAGTA GCTGGGATTA CAGGCATGCA CCACCACGCC CGGCTAATTT 
12401 TGTATTTTTA GTAGAGACGG GGTTTCTCCA TGTTACTCAG GCTGGTCTTG 

12451 Iactccggac CTCAGGTGAT CCGCCCGCCT tggcctccca aagtgctggg 

12501 GTTGCAGGCG TGAGCCACCA CACCCAGCCC AGAGAGGATT TCTTGAGTGA 
12551 AATGTGTTCT CTATTGAAGG CAGAGGAAAA AGAGTATAGG AT6AGAAATA 
12601 CCCAGATTTC CATCCCCCCA AGAGCTTGTA CATATATAGA TATACGTGTG 
12651 TATATTGTAT ACGTGTATAA TATAGTAACA TACACCGTGT ATATACGTAT 

V^Ol ISXIaI AWATATGG gatgatttat atatatatat atacagcagc 
12751 acgattgaac tattgcacaa ggtccaagac attatctcag aaaggagtag 

12801 ATAATCCTGA CCTAAGGAAT AGGGAATGCG GAATTCCAGG AAGCACTTCT 
12851 CTTTciTTTT CCCCCACTCC TCCCAAGCAG TGCCTCACTT CTGCCTTGTC 
12901 TAGCTGTACT CCGGAAAATT AAGAAATTTA TGAGTGTAGC ACCACGTATA 

12951 C^TGGGAA GGATGGGAGT cagaagtcaa gtgaactcag cccgcctctg 
13001 tgtactttgc acttttccat ttcccttggt accaggcact ttcatactta 

13051 ATCCATAGTG GAGCTGTCAC AGTGAGCAAC TCTGACAATG ACAGOTCTA 
13101 CCCCAGAGGC CACCCCAAAC ATGGAGCTAA AGGCTCCAGC TGCAG6AGGT 
13151 CTTAATGCTG GCCCTGTCCC CCCAGCTGCC ATGTCCACGC AGAGACTTCG 
13201 GAATGAAGAC TACCACGACT ACAGCTCCAC GGACGTGAGC CCTGAGGAGA 
13251 ^JS^C^Si AGGCCTCAAC AACCTCTCCT CCCCGGGCTC CTACCAGCGC 
13301 TTTGGTCAAA GCAATAGCAC AACGTGAGTA GCTGTTACCT TCTCCTCTCC 
13351 TGGGTGGGAT TCGTGTTCCT AAGCCTCCCT TGGACTTATT TTTCCCCCCA 
13401 ATTTCATCAG TCCTCCACTT TACAGATGAA GGTCAGCAGT GAAGA6ATTG 
13451 GGCGAGTGAC T GCGCTGA GA "rrTGCCTTCC TGGGCTGCCA CTCTCTAGGC 

imi TII^- ^7c?^SS^ ^^^^ 
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13601 TAATTTAAAA GTGTGTCATC TGTGCTAGAA CCCCAAATAA TTTCCAAGCA 
13651 TAATCGGAAG CTTCOTTGC AAAGTCTCCC CCCGAATTCT GCCCCATCAC 
13701 CAAATCAGTA TTCATTTGAC TGAAGAAGTG GGAAGAGAGA AGAA TTAA CT 
13751 TCTGCACTTA AAAAATTCAG GGTTGGTAGG AAAGGAAAGA TAGACTTTGC 
13801 ATTCTCCAAA GAGGGCTTAA TCTCTT6TCT CCAGAA ACTG GGACCCCAGA 
13851 CTCATTTGGG CTGAGTTTGG CCC6CTTCAG GTCTCACfTT CCCCAAATGT 
13901 AAAGAAAAAT TGAGGACTCC ACCACAAAGC TATG CTGGCT GTGTGGGGCT 
13951 CACCACTTGA ATTAGAAAAT TCAGAGGAAG TTTTGCTACT CCATTGAGTT 
14001 AGTTTCCCAG CTACTCCTGA TTTCAGCAGA CCTCTGACTT TTCTCTGTGT 
14051 CCCAGCATCT CAGCTTTTGC AGTCCTGTTT ATTCCTCAAG CTTA6CTATT 
14101 AC C I I I ICTG T GI I I I LI IG TGGACTGAGT GTGACTTACT GAGAGATCCT 
14151 TCATGTCCTA GACTTATGCC ATTCCTGATG ACTGCCCAAG CGGACCATGG 
14201 AAGCTTCTGG GCTCATCACT 6GAGAA6CTC CCTCTGCCTG CACTGTCTGC 
14251 TGGTACAGGG CATTTTCTCT TGCGAACTGG GGTGGAACTA GAAGAATGTC 
14301 TGTCCACATT CCTGGCCCGT CACCACCACT AGC TGAT TTC TATGCCTCAG 
14351 GCTGGAAGTA CTCAACCAGT CCTCTAAGAT TCTGTTTCTG TAGCTTATTT 
14401 CTCAGGGGTA TGCTTTTGTA GATTCCCCAT TAGCCTGCAG TGGGAGTTAG 
14451 CTGGTGGTAG ATTGCTTAGA GCACAGCTGG CAGCAGTGTG GATCACCCTG 
14501 CCCCTCTTTC CTCCAACCTT ATCAGCATTG GCAGCCCCCA TGCAGAAGCA 
14551 TCTCCACACA CAGCCAATGG CATGTGATGG CTTCCCTTCA GAGGTCATGC 
14601 TTGTTATCGT AAGATACTTC TAAGCTTCCT TCTCTGTAGT TTCCTTTGCA 
14651 GTTTTTGCTC CTTTTTGATC TCAGATATCA ACTTGTCTAA GCAATATTTA 
14701 GCAGATGAGG TCTGGATTTT TATGTTTATA GAGACATCTC TGAAGCTCAA 
14751 AACCTACCAA CTAGCAACTT TAGGATAGTA GCTCATAGGT TTTGGACAAA 
14801 ATTATGTCCT TGTTTCTTGG AAATCGAACA A ATCA GAAGA TA CCTT CCTC 
14851 AGGCTTGTAT TGTGACATTT TCCAGGGTAT ACTTTGTTCC GAGTTTCCCT 
14901 TCCTGCCTTG ATGTTGTGAT ACAGTGTAGG TGACCAGGGA AGCCTATCTG 
14951 TAGTTGATGG CAGGTATTAC AGTCCCATCA CAGGTGGTAC AAGATAAAGT 
15001 AATTTGCTGG GGCTTAGAGG ACTGGTTGAG TACTTCCAGC CTGGGGCATA 
15051 GGATCCACGC AAGGATTTAT ATAGAAAACA TGCCAGGTAT GATTAAGGTA 
15101 GAGGTTGATT TGGAGGACCT TCTTAACCTA AATT AATATT TTAATATGTC 
15151 GGAAGTGTTA GAGACAAGTT TTTGAGCTGG GTTCCTTTTA TATTTCTGGT 
15201 TTGCCCCACC CTTTTATCTA GTTTGCGCAA GGAACAAAAT ACATGGAAGT 
15251 ACTTCTACAC CTACTGCACA TATGCATGCA CACACCTGGC TCTTCTA6CA 
15301 AGTCAAGGGC TCAGCAAAAA CCCCTAGTTA GG GGGTG CAA ATAGGA ACCC 
15351 CAAACACTTC CATGAGTTTC ATGGGTTACT TCCTTTTATT nil iGAGAC 
f^narrTCTTCr TrTGTTGrrr AGGCTGGAGT GCACTGGCAC AATCATGGCT 
15451 CACTGCAACC TCCATCfcCT GGGCTCAAGT GATCCTCCCA CCTT AGTTTC 
15501 CTAAGTAGCT GAGACTACAG GCA TGCT CCT GGCTACTTTT TGTAn i ii i 
15551 I I I I I H I I I TGTAGAGATA GGGTTTTGCT ATGTTGCCCA CTTAGTCTTA 
15601 AACGCCTGGG CTCAAGTGAT CCGCCTGCCT CGGCCTCCCA AAGTGCTTGG 
15651 ATTATAGGCA TGAGCACCAT GCCTGACCTG TGAATTATTT CTTAGTGTGT 
15701 TCAGTGAGGT TATTTACTAA CACTTGATGT TAC CAAGC TA TTGACTGCTT 
15751 CGAAGACAGC CTCATTTTAT GCTGTTGGGC AGATTTTTCT TCTTGTTGCC 
15801 CCTCTGAGTT CCATTATATA TATCAAGCCT CCGTGCTTCT TCCCCATGCA 
15851 AACTGAAACC AGCAGACTGA AACTGGCTCT CTAAAGGTGA GCTGGAGTAG 
15901 TCATTTGCAA AATGTGGTCT GCACACTTTG TGGGCTTCCC AAGACCATTT 
15951 CAAGAAGTCT ATGAGGCTAA AACTCTCTTC ATAATAATAC TAAGATGTTA 
16001 TCTG C I I I II CACTTGTGGA TATTTGCACT TATAATGTAG AAGCAATGGT 
16051 GGGTAAAATT ACACTGTAGA ACGAATCAAG GCAGTGGCAC CAAATTATAC 
16101 TAGTTGTCGT TGT A I I I I IC ACTGCCACAC ATGCGCAAAG AAAAAAGCCC 
16151 TTTGCACTTA AGAATGTCTT TGATGAAACT GTAGGATTAC TAATATTTAA 
16201 AAATTTGAGA CCCTTCAGTA TAGGTCTTTA ATATTCTGTG TGGCAAAATG 
16251 GGAAGTATGC ATGAAGTACT TCTATGAGTA CCAAAATATG TTACTTGTCT 
16301 TAAGGCAAAG ACCTCGAGTG ATTATATGAG TTGTCAACCA AACTTGCTGC 
16351 CI U I I 1 I I I TTTTCATAGA ACTAGAAAGA ACAACTAACA AACTGTAGGT 
16401 CATTCAGACC CGAGTACTTG TAAGACATTT TCTTGAAAAT GAAAGAAATC 
16451 AGCCCATCAC CTCAAGGAAA ACAATA6ATA ATACATATCT GTTGCCCAGA 
16501 ATAAAATTCA AGCTTTCAAG CAAAATTAGG AAAAAAAACC AACTTGTATC 
16551 CAGTACCATG AGCTTGATAG CCCCTC TACT TGAAGACTTT TCTGATGAGA 
16601 TTAGTGGTGA TATTAACAAA TATGACTTTT TGATATTATT AATATACAAT 
16651 GAAGATGTTA ACATTTGGAA GATCTGTGTA AACTCAACCA AAGTATGATG 
16701 TTAGGAATTC TGCATGGGTA AAAGATCCAT TGAAAGAGCA AGATCACCAA 
16751 TGGATTTTTT I I I I L I I I I I TTTTTGAGAC AGTCTTGCTC TGTCACCCAG 
16801 GCTGGAGTGC AGTGGCACAA TCTTGGCTCA CTGCAACCTC TGCCTCCCGG 
16851 ATTCAAGCGA TTCTTCTGCC TCAGCCTCCC GAGTA GCTGG 6ATTACAGGT 
16901 GCCTGCCACC ACTCCCAGCT AATTTTTATA TTTTTAGTAG AGACGGGGTT 
16951 TCGCCATGTT GGCCA6GATA GTCTCAATCT CTTGACCTCA TGATCTGCCC 
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17001 GCCTTGGCCT CCCAAAGTGC TGGGATTACA GGCATGA6CC ACTGCACCTG 
17051 GCCTGACTTT 11 I HI I I I 1 TAAATACTAA ATGTATCAGG GACTTCTGGC 
17051 ^CTGACTU ^1^^ TTCA CTTTGT ATCTTTCTGT 

17151 Sg^ GGGScflcTG TTATTATTAT TATTATinTT TAATTTCCTC 
17201 TGTTCTCTTA CCAGTGTTTG TCC GTCATTG TTTGGTTTGT CATCCTCTGT 

17251 T^GTTTTG GGATCTGAGT CM I ™AGATG GAGTCTCCCT 

17301 CTATTGCCTA GGCTGGAGTA CAGTGGCACG ATCTTAACTC ACTGCAACCT 
17351 CTGCCTCCCG GGTTCAAGCA ATTCTCCTAC CTTAACCTCC T GAGAA GCTG 
17401 G6ATTACAGG CACATGCCGC TATGCCT6GC TAATTTCTGT ATTTTTAGTA 
17451 GAGACGGGGT TTCGCCTTGT TGGCCAGGCT GGTCTCGAAC TCCTGACCTC 
17501 AGGTGATCCA CCGCTTCGGC CTCCCAAAGT AGTGGGATTA TAGGCATGAG 
17551 CCACTGTGCC TGGCCAGGTC TGAGCCTTTA CAGTGGTCAG TTCAGTGGTT 

17601 A^S^c JSaatacac TTGGAAAGGA tagagtgtct gaagagagtt 

17651 GGAGCACCCC TCTGGTCTAA TCTCTGAGAG AAGGGATTCT CAGAAATGTC 

17701 S^^TC^k ^Sttacagc acagtggata agagggggag ctctggagtc 

17751 AGACTGCCCA AATTTGAATC CTGCCCCAGC CCTTTACTAG GTATGT6ACC 
17R01 TTGAGCAAAC TGCTTCATCA TCTATAAGAT AAAATCTTAC AGGGTTGTTG 

178^ Vg^^^ I?^taat gcatataagc actgagatcc taataaaagt 
17901 taactgtcat ggttatcatt tccttggctg tcttccactt cagatggttc 

17951 CAGACCTTGA TCCACCTGTT AAAAGGCAAC ATTGGCACAG GACTCCTGGG 
18001 ACTCCCTCrc GCGGTGAAAA ATGCAGGCAT CGTGG TAAGG GTCTGCATCA 
18051 GTGGAGAGGA GTGGTGACAA ATTTTAGGAG GTAGCTTTTT GTTGTT6TTA 
18101 AA^TCTACTT GCTTTAAAAC ATTTTAAATA GAGAAGCATT TTAAAAAAAT 
18151 CAGTTGACAA ZaAGCGGAAT TCAGACATTC ATTCACTTAA AGATATTTAT 

18201 TGAGAGTGTT CTGTGCGTTA ggcactgttc taagctctta gaatacatca 

1R7qi fiTGAATTAAA TATTCCTGCC CTCATGGAGC TTACTTCATG GTGGAGAGGA 

illoi T^^^ Iggctcgagc AGTTTCTGTC AATAATATGA actaatgagt 
18351 Sacaga tgtctgccca ttttctacag tctcccatgc cctgttccta 

18401 AATGGCCAAC TGCAAGAATC TTATGTCTTC TTTTTGTGAT TTACCTCCAG 
18451 ^^GACTGCCT 6CCCAAAGCC ATTCTGGTTT CTTTCGGAGT T6AAGA6AGA 
18501 CTCAGAGATG TGGGTTGCCC TTAGCTAAGT 6CA6TCTTTC TTGATCTGGC 
18551 ATTGCTGTAA AGATAACTTA CCCGTCTCAC CTCACATCCC T^AGCCCAGC 
IRfini TCTTCCCACA GTCACAGGAG CCTTCTATTC TGCTGATGTG CACCAGTCTT 
18651 GGAACNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
Wol} SSSmSmmmmm SmmmmnEn NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
18851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNN^^ 
^8901 NnJJnNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1Rq51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19001 KnKSnnK nSnNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19101 KnNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19251 KKnnKKnNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19301 KKKnSSnNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19401 KSKK SnSJJnNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
iq451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19501 SnNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNN^ 
19601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN^^^^ 
19701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19751 SnSKUKKSnN SSSnKnNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNTGTTCT GTGCGTTAGG 
19851 CACTGTTCTA AGCTCTTAGA ATACATCAGT 6AATTAAATA TTCCTGCCCT 
19901 CATGGAGCTT ACTTCATGGT GGAGAGGATG TACTGAGATG GCTCGAGCAG 
IQQSl TTTCTGTCAA TAATATGAAT TAATGAGTTA GTTACAGTAT GTCTGCCCAT 
20001 ^TfcTA^ CTCCCATGCC CTGTTCCTAA ATGGCCAACT GCAAGAATCT 
20051 Wal™ fTTTGTGATT TACCTCCAGT TGACTGCCTG CCCAAAGCCA 
20101 TTCTGGTTTC TTTCGGAGTT GAAGAGAGAC TCAGAGATGT GGGTTGCCCT 
20151 TAGCTAAGTG CAGTCTTTCT TGATCTGGCA TTGCTGTAAA GATAACTTAC 

20201 JcSf^c ^SJatccct TAGCCCAGCT cttcccacag tcacaggagc 
20251 cttctattct gctgatgtgc accagtcttg gaacagactt atcttatgtc 

& SISSS ^c^S^^^A ^TSS^a^ Tc^T^^ 'c?5K 
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20401 CTGGGCCCAA AGTTTTGTG^ CTGAAAACTG CCTTAGTAGC TTTT TAATCC 
20451 TTTGTGAACT GAGTATCCAT TGGGTTCACT CCTAATTCTA CCTACTTTTC 
20501 TCTCTCTCTT TTGCCTGCAA TATCTGTCCC CAGATGGGTC CCATCAGCCT 
20551 GCTGATCATA GGCATCGTGG CCGTGCACTG CATGGGTATC CTGGT6AAAT 
20601 GTGCTCACCA OTCTGCCGC AGGT6AGAGC CCTCTGAGCC ACCTCTCAAG 
20651 TGACAGATTG TCCTTTTGGG TTCTGTTATC AACCCTGAAA ATGAGCACTG 
20701 ATGCAGACCA CTCTCAATTC TTTACACTGG CTGGAGGTAG CAGCTTATGA 
20751 TTGCAGCGTT TTCCTTTCCC T6GTTATTTT TGCGTTCTTT TCTGGCTCAT 
20801 TATCATCTGT TAAATTTACT TATGCCCAGT 6G6TACTACA TJCTAATTTC 
20851 ATGGGCGTTG TAATATTTAC CCCATTGAAA TGATTCTACC AGATGCrTCT 
20901 TAATTATAAT AAAAGTAACC ATCCTGTCGA CTGAATACTT CTGATCTTTG 
20951 AAAGCACGAG ATACAGGACT CAGAGTGGTA CCTCCAGGGT GAAAGATGGG 
21001 AACTGGCCCA GGTCTCAGTG GCTOTTTTG TTCTGTCATT GTCATTGTCT 
21051 AATCCACGTG CTCTGTCCTT CCTCTTCCCT CCTACTCTTC CAGGCTGAAT 
21101 AAATCCTTTG TGGATTATGG TGATACTGTG ATGTATGGAC TAGAATCCAG 
21151 CCCCTGCTCC TGGCTCCGGA ACCACGCACA CTGGGGAAGG JAACTGATTT 
21201 CCTCCTTCCT TTCAACTGTG GCCTCCCAGT GTGAGGCCTT CAGATGGGGA 
21251 GGTGCAACGT GGGAGACAGT GTAAAGCGTG GAAAGAGTGC TGTTTGGGTC 
21301 AGTTGCCTTG GGCTGTGGCT CAGCTCTGCT GGTAGTAAGC TGT6TGACCT 
21351 GGGGCTGGGT AACCCCTTTT TTCCTTGGGT TTTAGTTTTC TTATCA6GAA 
21401 AGCATGGGGC CTGGCCTGAA TGGTCTCTAG AGCCATTCCA GCTTTGGCGG 
21451 TCTATGACCA GTGATTGTTT TTGATTCACT CATTTOTCA ACAAATGTAT 
21501 TTAAGCACTA TCTTATAAAT GGAACAAAAC AGTTCTAGGT AAGAAGGGAA 
21551 GATTTCCT6A AGTAAATTAT GTGGTTCCTA CCCTCCAGAG GCTTGTAGTC 
21601 TGTGTAAGGA AAAAGAAATG TGGGAAGAGA AGCCGGGGAA CAAGATAAGA 
21651 GACCAGTAGT GGGAGACACC CATAAGAAGA AAGTGTCATG A GCTAG GAGT 
21701 ACACCCTCAG TGCTCAGAGA GAGAGGAACT TTAAAGATTC TCTTGTC6GC 
21751 TGTGCCAGAT GAGAAACGCA CATGAGAGAT AGGAGCAAAG AAGGCTTCAG 
21801 GAGAAGGTGA 6ATAAACTAG AGCAGGGCGT GGAGATGAGT TTGGAGGTGG 
21851 GAAGTATTTG CAAATTTCTC GTTATGGTAA CTCTTCAGTG TTTGGAGGGA 
21901 AATATTATGT TTGTTTTCTA CATTTAAATG TAGGAAATTG ATACTATCAA 
21951 GGGCTAAAAA TTCTTAAAAA AAAAAAAAAA GAACCACATT AAAACTATGT 
22001 TCTCTAGAAA AGTTCCTTTT TGTTGTCATA GA6GAAACTT ACTTT^TIS 
22051 ATAGTCACCT TTATCCTGTG ATGCAGATTA TATAGTTCTT TTGGCCAAAT 
22101 TATTTTCTGT AACT6GGAGA AGCTAGATTG CCAGGTGACC ACCATGAGTT 
22151 GGGTGGTTGT TAATTCTTCC TTCCATTCTT TCTTACTACT TCCTTTCTTC 
22201 CGCCCTCCrr CCCTCCTTTC cttccttcct tttaataaaa tgtgtgctat 

22251 TTTAATGCGT GcfcATAGTA AAAACTTTGT TTTGATCAAG ATAGG ACATA 
22301 AAGTAAAAAG TGAAAGAAAA TTTTGGTCAC AGTTGCATGG GTAGOmTT 
22351 GGAATTTGCT GTATAAGTAG AAACATACAC ATGTTCTTAA AtMi ii iiGC 
22401 ACAGATTGAC CATACTATGT ATACTGTTTG GAAACTTGCT TTTTCCCCTT 
22451 AAACGTCTGA GACGTTTTTC TCTATCAGCA CATAGAGATT TAACACATTC 
22501 TTTTTAACTG CTGTGTAATG TTCCATTTAA GAACGGTCTA TAATTTAATC 
22551 ACTCTGCTTT TGATGA TCCT TTAGGTTGTT ACCAGCTGCT ATTGTTCAAC 
22601 CAGCAGTCTG TTTTTGGTAC ATCAGTTTCT 6TGTCCTTAA TGTGGGACTT 
22651 GGTTGGTTCT TATATCCAAG TTATAGAGAC AGTGAAGGGG ACTATTTCCT 
22701 TGTGTTTTAT GTCAAGGGCT CCCTGTAACT AACAAAAAAG TGTGAGATGG 
22751 GATAGGTGGG CAGATGTGTA GAGAGGATGC TAA6GGGCTG GGCAGTGGTC 
22801 ATGGTGTCTG TGCATGTGTC TCACCTCATG CAGCATTCCA GACGAGAAGC 
228S1 CAG6AAGGGG ACGTCGGAAA CCACACAGAT AGCACCTCCC TCACCTTCTT 
22901 CCCAATGCCC CAGACCAGTG GCACCTAGCA TGGTTTC7TC JCCTGCCAGG 
22951 GCATCTCGTC CTTGTCACTG CCAGGAAGGG TCTGTGATGG CTTGGGGAAA 
23001 AGCACTGTTA AAAAAACACT TAATGGGCAC AATGTACACT GTTTGGGTGA 
23051 TGGGTACACT AAACGCCCAG GCACTACCAC TATGCAGTAT ATCCATTTAA 
23101 CAAAACAGCA CTTGTACTCC CTAAATCTAT TAAAAAACAA AAACAAAAAA 
23151 CACCTCCCCT TCTGGGAGCA TTGCATTTGT ATTGTAACAG TCTrTGTATT 
23201 CCTTCCTTCC CCACCTCCAG ACGTGTTGTG GACTTCTTCC TGATTGTCAC 
23251 CCAGCTGGGA TTCTGCTGTG TCTATTTTGT GTTTCTGGCT GACMCTTTA 
23301 AACAG6TAGG CACCTGGTTA AAAAAGAAAA AAAAAAAAAA AACCAGAGCG 
23351 AGAATGGCAA AAGATGATTG AAGTTTTTGT TTAGGATTTT TTCCAAATCA 
23401 GCTTTTGTCA ACAAAAGAGT TAAAGTTTTC ATATTTTACA TAGATCTACG 
23451 TCTTCTATTT GATTCCCATG GAAAGAGCTC GGGCATAGAG AAACC6CCAC 
23501 ATGTCTTGTC GACCCTCCTG TCCTAGGTAC ATATGATCAA ACCTAGCTCA 
23551 GACAATTGGG TTGCTGATGA TAGTCGTGAA GTTCTCTAAA GATGGCTCAC 
23601 TGGCCACAGA TTCTAAAAGG CCTTGTTCAC ACACCTGAGC CTTTCCTCAG 
23651 GAACCTCTTC CAGCAGA6GA TCCACCGGCC TCTGTTGTTT GAGAGGTGTT 
23701 TCCGTTTTCT TCCTTCCCCT CATTCTAGGT GATA6AAGCG GCCAATGGGA 
23751 CCACCAATAA CTGCCACAAC AATGAGACGG TGATTCTGAC GCCTACCATG 
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ffi s ^ s ill 
™i ™ s s s 

Siiii '^T^^ Tc^^ ^-s- ?^^|| 
n^^cS ssxssss^ isss^^ 
HIi i^i^A- ^^^'c xXt^^ 

24401 tIw^I^IS totI^w a^?I^gtaca tgcataatat atattatata 

24451 ATGTATGTAA TATTTATATA TTATOTATAT ATATACATAA TATATATATG 
24501 TAAOTGGAAT GTAAATA6TT ATATOTTACT ACTGGTATGT CTAGATTAGA 
tJiiVi y^^lZf-^f TTGGGCCCTG TTGACATTTT GGGATGGATA AATTCTTTec 
?1lm tSJS^G TCCTOTGCAT TGTGGGGTGT TTAGCAGCAT CTCTGGTCTC 
24651 ^ctSS ^aS^^J JXIcCCTCCA TGAGTTATGA CAACCAAAAA 
ii7M TCTCTCCAGA CATTGCCAAA CCTTCCTGGG GGGCAAAATC GCCCCCCCAC 
IaIv! r?[?rrf^ rTGGTTiAGA CTTTTTTCAA TTAGATGGTr AATTCATGAT 

iJsoi ^^^^^ ^aSggaaaa TGTTAAGATT aaaataaaaa 

24851 ^MTTTTTC TAACCTGTAT TTAGATAAGT AATTCCTTAT CAACTCCA6T 
74q01 TAATTTTTAT TTGTCAAAAT TATAAATTCA CTTGTTCCTT GCCCTCACTT 
24951 I^SItcca GGCAAGTCTG TGGGGTGGCA tgagagagaa catctotata 

Sis ^gII^? iS^^A iSS^Gt 

1 iSi?c ^A^^si^^^ ^^.Jrc^A ^^^cSi^i 

S ^G^^^ ^^"c^S^t§ 
^l^m ArArScATG TAAGTATATT GAGAAATACA TTGAAATATA TIII U C 
25351 Am^^^ JS^AATmr fXAAGTTACA TGTrCATCTA CCTTGCTTTT 
75401 TTCCACCTTA AAAATGCCTT AGTGAGCCTT CCAGGTTAGT ATTCCTGGCT 

25451 cta^Stc[t S^TGTTAGTT gtcacattgt atcacagcaa ggagatttgc 
25501 tgccatttat ttaacaagtc ctcactcagt ggctatcagg ccatggataa 

lllfl ^^ISaT TAmCAGTA TTAAGACAGT GAGATGTTCT TATACATTCC 

25651 ^i^^ gIg^^ k7X\^G^ Tatg^tacca agttgttttt 
2S701 tcSmagct atttcctaat ttcatagaga ctcccttcca cagtatttaa 

?c7?i r^rrrr^TTT TTCCATCOT ACTAATACTG GATGTATTAG TATTATTTAT 

25801 ^l^V^c ™It^ wSXtatat TAGTATTTAT atatagtata 
25851 aTTa?a?tIg tattattttg gtcagtctga caggtgajta ttatctcatt 
25901 ttcatacagt ctgcttaaca gtgacccagt cacccactga tatagtttcc 

2SQ51 AGGAA6ACAG TGGCTCATAA AAAGCAGGAC TTCTTGTGCT AAG CAAATGA 
ilhl} rATTATTAAT TTAGATTAAC ATTTTGCTCT GTGAGTATTG ACTGTTTTTT 

26051 ^Vc^(5^l IS^^St^ gISJagata TTGACTCTTC tgcagaacta 
<^|V;yr !l? ?ataaaataa ATAGTTCTTC ACGTCCCCTT tttatgagac 
26151 TSI^ ^<^SS^ gcactataga cca^ttccat taccaaacaa 

rrAArrrxAA ATATTTACCC ATGATTTCAT GTTGTAATTA GAACTCTCAA 
26251 i^CCT^ St™C aJ?^ACACT AAAG^T TGAGTGAGGC 
7Mm CACCCCCATA GACAGACACA CCTGCTGCAT TTACTTTTAA ATCATTGCAG 
7fi«l CTAGTCGGGC TGTGGTGGTT GCCATGATCA GAGGGCTGGG ATAAGAATTT 
IfAM r^CTTCTTAT AGCGTcfcAG TCCCAACCAA CTGGTAGTAT CCATCCAGAG 

^^irrTTTAT GCATAGTACA ACCAGGACAC AGAGCAAT6T CTGCATAAGG 

7fi?m J^gSctgc tg^tttcttg agagcaattc tgagtcttcc tctgggctta 

26551 G^^^S^ (^(^T CAAATAOTGC CGTCTGCCTG GAGTACAGCA 
26601 Tg5S5SaGA GGnTGGCTG TGrTTTGATG TAGTCACTGC CCATAGTGTT 
^cfi^l rTAfTTTaCTT CATTTTGATG TGTCATACAG CTAAAGATGC TCCC I I 1 A^j^ a 
2670l tJ^?™^ ^^S^CC TCTGCGGCTT GTTACTACTG TTCTGTTTfG 
26751 GCATTGTGCC CCACTTACCA TGAGGATTCC cctactgttc aaIGTttctg 
9fiRni AATTTTTTCC ctaatcctaa gcatgtacat gactgttcct cttgcccctc 
26851 ^SIS^GC S^^ctSgt agcagaccaa ggtcttccac agagagcagg 

^coni TT-rrrrTrrr, TCTTCAGCAT GTGGAGTCTC AAATGGAACA GTTCTGGGCA 

lisi Sgtc^I SS^g^ gctcccaata aatgttttat cactgcatat 

n^Arti ^rTT-rr-rrrT r;AGATGTATT TTTTCATAGT TATAACAGTT TCAGGATTGC 
AfJIrrUS ctc^caatS ATGTGTACCT TTAACAGCAT TTTCTCAAAA 
???ni S^MTTGATA ATATGGTAAG ACCTCACTTA ATATCATTGA 

27151 ll^^C^A ^I^A^TAAAT GTATGTATAG CGAAATCAGT 
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27201 TTTTTTCTCA TCAATGTTAT AACAAAACAG CGTTGAAGGA AGTGACTGTA 
OT^^i rrrrlTTTCA CTTAAAGTCT CAGTTTCCAA GAACTTATTG ACGACAAGGG 
'27301 A^S^CT ^^SS i??SiGAGA TATGTTAATA ACGAGCTGAT 
27351 TTTAACATGT ATGTTTCTTT ATAAATTAAA CTTTTCTCAT T^AGTTGGTT 
27401 6GGTCAGTAG CAATCAGTAA GTATGTAGAA TAATACACTT CTTCTGCTGG 
27451 CCTCATTCCC ACAATATCCC CACATATGGA TTGTGAAATT CCCAGTCT6A 
77501 TACTTGAATC TGATCTGATG TATGAATAAG AGCAGGAGTC ATTCACTAAC 
27551 ^SaCAGATAG CACCTGTTTC CAATAACTTA GGTTACATrT GTGACTCAGG 
27601 AATAATTACA GGCCACTCTT GCTCTCAAGT CCCATTGTAA AGGAAAAATA 
27651 CCTATTACCC TGTCTTCATT CCAGGTATTG AAATGCTTCT TACAAAGGGA 
27701 TCTAACAGAT TTCTTAGCAG GGGCCCAGGG AAACACATTT ATTTAATTTT 
27751 TTTATTTTTT CAAAAGCAAT ATTACTGCTT TGAAATCTTT CAAAGTGAAG 
27801 GCTGTTATAG AGCTTAATAA TGGATCTCCT TTTACTTGCC TGAAATTATT 
27851 CTGAAGCCTG TTAAGAGCAT GCCCCGTATT ATCCAAATAG CCATACAGTT 
27901 Satcaattt TAAAACATTG TAAAAGGCTG TTTTAACATC aatttttatt 
27951 TTAATT6AAG CAACATACAC ATGTGGTTTA GAAAACCAAA TTGTAAAAAG 
28001 ACAGCAGCTT TGAATCCCTC CTCCCCACCC TGCCCCTTCC ACACAGTCTG 
28051 ^TACTGGAGA CTGTTGTTCG GTGGAGGATT TTGTGACTAT ACCTCT6TCT 
28101 TACTCAGGGT TCTCTAGAGG GACAGAACTA ATAGGATAGA JGTACATATA 
28151 TAGGGGAGTT TATTAAGGAA TATTAACTCA CGCAATCACA GGTTT^^^ 
28201 IcAGGCCCTC TGCAGGCTGA GGAGCAAGGA AGCCAGTCCA AGTCCCAAAG 
28251 CTGAAGAACT CGAGGTCTGA TGTTCAAGGG CAGGAAGTAT CCAGCACGGG 
28301 Saaagatgt AGCCTGGGAG GCTAAGCCAG TCTAGCCTTT tcacattctt 
28351 CTGCCTGCTT TTTAATCTGG CCACTGGCCG AGCTGGCAGC TGATTAGATT 
28401 GT6CCCATCC AGATTAAGGG TGGGTCTGCC TTTCCCAGTC CACTGACTCA 
28451 AATGTGGCAA CACCCTCACA AACACACGCA GGAACAATAC TTTGCCTCCT 
28501 tcactgcaJ^ CAAGTTGACA CTCAGTATTA ACCATCACAA cctcct tcct 
28551 waSaot- ta^Sjtcta cctgcagtta acagttgccc ™ctggc 

itlfU rri l M l I l A AAGCATTCTT TTTGCTCCTC CTCCCCACAT GTTCCAGCAC 
28651 J^^G ^^V^G TAATACTTTG AAAGTGCTCA AGTTCATTGA 
28701 TGAGAATTTT AAAAAGGAGA agaaaagaag aggaaaaagg aagagaacca 
7R751 ata-^Saaaat GTACCACTTT CTCTTCCCTT CCAGCTTTAT CTTTGATGTT 
lll^ ^l^^l PcAGAGTG AATATATAAA TTAAAATTAA AATTTTTTCT 



7RR?1 ''cS^pI^^ TC^Ag i^SJS^ACT TATCTCAATT 

28901 ^^^Imm I^tIgtgcat ACTGGCCAGG CGCAGTGGCC AACGCCTGTG 
||i ^^^SSTc ™gaggc CGAGGCAGGC AGATCACTTG AA^G 

29051 Tk^^ AGCC^G^^ ^GG^CgE aEctGTAGTC CTAGCCACTC 

29101 Ig^^ggctga ggcacgaaga attgcttgaa cccaggaggc agagggagot 
7Q1S1 t^ctgagc cgagatctcg ccactgcacg ccagcctggg tgacagaagg 
29201 tctmaSga tagatagata gatagataga tagatagata 

29251 S?Sataga atatatattc tgtggtttag ctgctatctt gttaattact 

79301 TAACTGATCC CTTGTTTGGA AGCACT TATA TT ui I iiCAG XTACTTTAI i 
29351 AAACAGCTTT GCTCAGTGTT TTTCATCTTT TGAl 1 1 1 11 f TCTACTTGAA 
mil ^S^^ GGGGATGGGA TTACCTAGTG AAAGAATGTG ACTCTrTTTA 
70451 TnrAAGCCCC AACATTTGAG I I I AATAG TACCTGGGGC TTGTCTTTCC 
29501 Jc^CAAG TGGGTTTTTC TTAGCCTGAA GAGAAAAACA TACAAAGGTT 
79551 ^AATGTCCCT AAATCATCTG TCAGGTATTA GACTTTCTTC CTTTAGAGAA 

29601 ^^G^ifS ^Sa^ggt atgacctctc cgattcagag ttcaaatctt 

7Qfi^1 rAATTTCTGT ATAGCCTTTT GCTTTGTTTT GCTTTCTGTC TTTCAGAG6A 

29701 ^^GMXC cagccaStc CCCTTGGTGG ccccttggaa gacctaccct 

29751 CTCTTCTTTG GCACAGCGAT TTTTTCATTT GAAGGCATTG GAATGGTAAG 
29801 AGCreCACTG TGATTTGGGC TAGTGTTCTC TGGTGCCCTT GGTGTTCTCC 
29851 AGCT^^^ S^icAATGC TGAGGAAACA TTGTTAGAAA GTATCTTCTG 
29901 AGGCCAGGCA TGGTGGCTCA CGCCTGTAAT CTCAGCACTT IGGGAGGCCT 
29951 AGACTGGTGG ATCACTTGAG GTCAGGAGTT CGAAACCAGC CTGGCCAACA 

^nnm tggtgaaacc ccatctctac taaatataca aaaatcagct aggcatggtg 

?rSA?GCCT ATAATCCCAG CCACTCGAGA GGCTGAGGCA GGAGAATTGC 
30101 ^^^CTgS ^^JgSSg ^TGCAGTGAG CCAAGATCAC 6CCACT6CAC 
30151 TCCAGCCTGG GTGACAGAGC GAGACTCTGT CTCAAAAAAA AAAAAAAAAA 
36201 AAAGAAATTA TCTTCTGTAA CTCACTGGTC AGTTAGTGAA TAGTGTTTCG 
30251 GGGATTCCAT T6AGATTTCC CAGCTTCAAC TTTTCAAGAC AAATTATATG 
30301 TAATTTTAAA ATGTTTACAT TCAAGGCCCC TTCACTGCAC ACTCATCTCC 
lolsl wSct^ ctSSaaTA GCATATGGCA ATCA66AAGG CAGGGTCTAG 
30401 AGTCAGACTG ACATGGGGGT AAGTCCTGGC TCT6CCATAG AGTAGCTCTG 
30451 TGACCTTGAG CAAGGGCTTC ATCTCTTTGA GCCTTCATTA ^GTTCCTG 

hi srji^s v<^^slv^ ^r^i t^^^ 
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30601 ATTTTATTAA TAATGTTATT TTATTATTGA ATCAAJ^AAT GCATGAATAA 
^nfi^l TTTCTCTGCC CTACAACATT GGTTTGGTGT ATTTTCTGCG TbUWAAWU 
M701 S^CCTTCAC TCO^CTCA GCATrCTGTG ATTTCACCAA ATGOTrrCC 

Ei^T IS^^^S^ ^AA^^cS T^^V. 

Si T^T^. '^^^ gg^J 

& ^si^s^ s S iiii 

iiSSi t"J?I^S t^'cS^J" SSI^ S^g^^l aItIJ^ 

i^^^^G Il^S^fc S^C^ g^^C 
31251 AGAAAGTGTG GAAAGATGCC CAGAGACCTT AGGTTCTTGG GCATCCTAAG 
31301 GGACCfTGTG CTAAATTTTT AGTAGCTTTC CCTAACAGCA ^AGCGC^ 
31351 ATTGTTTGCT TGGTTTTATT ACCCAAGACT TOTACACAAA GTTATTCTGC 
31401 AAACATCATT TGTTTTCAAG ATTTCTTTGT ATTTCTATTT TTTTACAgTA 
31451 GAGAGAGAAC ACTGCTAGAT TGACTCTTAG TTTTGGATCT A6GGCTTGTT 
31501 CATTGCATCG GGGTAAAGTG CCAGGCTGCA CACTGTATTC ACCGTGTGCT 
31551 CTOTGTTCAT GCAGCTGTCA CAGGCCAGAT ATGGGCTCCC TGCCCTCTGG 

^^A^s i^ss ^<^i S 3 i i 
I i^^^^i r^^^ --ig FSi 

^^l^^O JSS^^CA^ ^^^s 

3lloi ?StCA«GT CACACCTGCC ATTTTCCCAA rrCCTAAATA AACTCATTTT 
31951 StAGGGGCC ^TfcCnTGC TTTTACCAAA GTTAAAGAAT GTCCTCTCTT 
32001 ATGATGAAGG AGGTAGCAAA GGTCAGTGCT TTGTAATA6G AGTCTGGAM 
32051 CTG6GCATTG TTAAGCACCC ATGTCTTGAG ATCCTCTACG AAGTCATGCT 
57101 TTCTTTCGCA CTGCAGTTTC TCTCACTTGG AGGTTTAACT TGTCACTGCT 
55lO IrSrTTGCC CCTGTTGAGC TAGAGTGGCA GTTTTCCCTG ACCTATATTT 
32201 ^iS??^ ASSclrfA GWGGAAGCT GTGATCTCAG GTTAGATGCC 
32251 AGGTGGGCAT GACATGAGAG GGCOTCTGG TTGCCATGTG GCCTCACTCA 
32301 SIaAGG GACATTCCCA GCCTCCAGGG ATCCTGWGC AGGAGG^CAG 
32351 AGCACTGGCC TGAGCCA6GA GTCCTGGGCT CGTGTCCT6T CCACCCTTAC 

'32«i '^^^'ak CCTCT^^ XXT^EffTA TTTAAAGAGC AGATGTGCTT 
32501 CTGGAGAATT CTGGGGATAA A AGAGTTA CT TTTTTTCTG* GGTTTTTTTT 

ii S S S & 

PS SSSS S3 
SSSSS SSSS 5Sf^ 

xS-rrrrrrl CTGCCAGCCC TCACTGGCTG CCCTGGACTG CATTCTGnT 
i'2951 GGgSIwS ^^A^^icCT ^CTGCTGAAG CCATTGGTGC TGATCAGCCG 

ii^i s^^T ^^iTg^^ Ji^^ss ^^isr^ g^^, 

ii fG^j^^^c 

^^Arrrrrr TTCTACCTCC CTTGCATTGG TGAATGTATT ATAGGGAAAT 
5?5?5?rS? mCMATGC TTCCTGAAAG GGTGAATGTC CCAGGGCATG 
33301 tSg^ ^^5iAaATGA ATCATCTCAT GGTGGAGAGC 

lllVi IrCTCTTAGC AGACACTGAG AAGCTTGTTG AGTGCTCTGC GGATCAGAAT 
33401 ^S?S^G T^^GC^ CTGATCTGCC TGGGTGTGCT TT^TTTTG 
33451 TTTTGTATTG TTTTATTTTA ttgtattttt taagacaaca gcactcagta 

i vi^i^T ssi^ '^^^ i^g 
iH? ssI^^x^I ^.^^tI ji^aS 

iSs^ sj^sss T^^jss^ ^gg^g 

ii S is^^^^ ^^^'^ ---- ---- 
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34001 CTAAGTCCAG ACTTCACAGC TCTCCAGAGG CTTTG GGGCT GCTTTCAGTT 
34051 TAATGATAGA GCCACCA6AT GATTTTTCCC AAGAGTTTTT ATTATCTATT 
34101 CATGGAGCAA GTATGACCTT TTACCAGACT C AGTC TTTAC AAGGTTGTTC 
34151 TCCTGCTTAT AGCATAA6AA CATCTTCTAG ATTTTAAATT CAACCACAGA 
34201 GAAACTCAAG GCACATATAC ACAGTCTGTA TTAGCACATT TAAATAGATT 
34251 TCCGACAAGG GAGGACAAAT GTTTCTTGCT GTT TAACA CA TGAGGGTCTG 
34301 GTTTAAGGTG GAGCTTTGCT TAGGGACAGA GACCTTTCCT TTTAATGACC 
34351 AGGTCAGATC TGTAAGTTGA TCACAGACTG TTTTCCTACT CTGTGCAGTC 
34401 AAGGCACTGG AGTAATAAAA TAGGGATATC CTGTGGTGA6 TTACGTCATT 
34451 TTTGGAAGCT ACACTTGAAG CAGTAGTAGG AAGAGAGCCA TAGTGGTATG 
34501 GAAAGATGGA ATTCTGCTCT GGCCTCTTGG TCCTGCAGTG TCTTCATCTA 
34551 ATTCTAGGGA CACTGACTTG GATGGGACAG ATATAAATAG GCTTGTGACA 
34601 TTTTAATTGC AATTTTGTTT TTATTTTTGA AGGCATGTAC ACCTGTATGC 
34651 CCATGGCAAA GATTGAGATT TTCAAAAGGT ATATA GAGAG CATTAAGCTT 
34701 CCACCCC6CG CCCTCCACTC TAGTTCCCAA TTTTACAATT T CCCATT TCA 
34751 GAGGCAACCA TATTCCCAGT TTCTTTTTTG I i H.I i iGTT TGTTTTGAGA 
34801 TGTTTAGTGT ATGATTGTCA TGTGGGGTGA GTGTGTGTTT TTTCCTCCTC 
34851 M I I IIC I I I TTTAAGACAA ATTGTAGCAC TCTGTAGGTA CTGTATTGCT 
34901 TCATGCTTTT TTCACTTAAA AAAAGTGATA TAAAACTGTC CCCATGATAG 
34951 TGATATGCTA TATCATGTGA TAGAGTGATA TATCATGGGG ATAGTTTCAT 
35001 ATCACACCAT CACACCTAGA GTTCTGCCTC ATACTTTGTT AAAAGCTATA 
35051 CGGGGGCACC ACGATTTACC TATCGAGTTC CCACTGGTTA A CATTTA AAT 
35101 TGTTTTCAGT CTTTCCTTCT TAAATAATGC TGCAGTGAGA TATTTTGAAT 
35151 ATAAGCTTTT GTGTATGTGT GTGAGGATAT C TGTGA GGTA AATTTCTAGA 
35201 CATGAAATTG CTGGGTCCGA AGGACATGTG GGTTTGTATC CTTGATAAGT 
35251 GTCAACAAAT CGCAATGGGA CCATTTTGCA CTCTTGCTGA T6ATGTATAA 
35301 GTGTGCTGAG CAGGCTTGGA ATGTCTCCTG TC TGTTTCG G CAGGTTGTAC 
35351 CAGTCAGTTA AGCTGCTGTA CTCCATC6GG ATCTTTTTCA CCTACGCACT 
35401 CCAGTTCTAC GTCCCGGCTG AGATCATCAT CCCCTTCTTT GTGTCCCGAG 
35451 CGCCCGAGCA CTGTGAGTTA GTGGTG6ACC TGTTTGTGCG CACAGTGCTG 
35501 GTCTGCCTGA CATGTGAGTA GAAGATGATA ATTGCCTTGC TTGTTTTTCC 
35551 CTAAAGGGCA CCCAGTCTGC AGGCTTTCAT GAGAAAAGAC AATGTGTGTT 
35601 GTAGTGAAGC TGGCTATGTT TGTGACAGAG AACCTGGCCC ATGGCCTCAC 
35651 TTTCAGAGTT GAGGCACCTC CAGATGGGGA AGTGAATTAA TTACATATGT 
35701 ACTGTAAAGA ACATGGGAAT GAGGACAGTG GTTTATGTAT AGATAGGGTA 
35751 TGAAATGCTG TGGAGGTGGT TATCATTCAG AGTAAAGACA TGCGATTACT 
^qROI ATrrCATATT AAATAAGGTA AAGGTCTGAA AGCCATTTAA CCCATATCTG 
35851 TAATGAGTAT AAGTTACTCT GATGAAGGGT ACTTATTTGC TTTTTCAAAT 
35901 AGTTG I I I I I CCACTGTGAC AAGTTGCTCC TTAGATTTCC TTTAGAG GCT 
35951 TTATGATAGT ATTCTAGACA M H I lAATG TCAGTCTTAC TAAATATGTT 
36001 TCAGAAAATT TCTATTGATT AACCTAGGTA TTTGATTGAT CACTTGTGTT 
36051 TTATTCTTCT TCTCTCAACC CCATTCCCAG GAGTGTAAGT TAAAAGACAG 
36101 GATACCCTTC TGTTTGCTGT GGTTGAAAAC TGGTGACATT TAGAAAATAA 
36151 AA6TAAATTT TTTTTGTAGC TTCTGTGAGT TGGTAGACTA 6AGAACCCCT 
36201 GA6CAAATCG GTTGATAATA GCTAATTTAA GTTTCTAAGA GATTT GCAAT 
36251 TGTTTTCCAA ATTCAAATGC TTAAAAGCAT AGAT TCCTCT TTTTGGCTCT 
36301 ATTTGGCTTT TTTTTCTCTT TTTAGGTTTT ATT ATTTT TG AACAAGAACC 
36351 TCTTTGCTTA TTATGTTGAG ACTTCCCTGA GAATTTTCTT AAATTATTCA 
36401 GTCTGAGCCT CTGTCTTTGG GATAAAGATA GATCCATATG ACTTTTTAAA 
36451 TTCTAATTAG GGTTGAATGT TTTAAGGATG AAAGATGGGA AAGTTGTCTA 
36501 GCATTTGCTC TTAGTCACTC CTTCA6GCCC TCTCCTAGAC CAGCCTATAT 
36551 AGAAACAGCC CACGCAGCAG CTAATCCAGG GGCCAGGGCT GTTGAAAGCC 
36601 AGCTGCTGTT CCCACAGCGA CTGAAAAAGA A6GAACATGA TGTATCCTGC 
36651 TTTTCTAATA GATTGCCTTA ATGTGTGCT6 CTAAGATGGG ATGCT TGGAC 
36701 TGTAAATTTT AATCCTATCT TGTGCCAGTA ACTCTCCATG CTTTGATTCC 
36751 AAAGTGTATG TTTCCACCGT GGATGGAGTA GCTCTAAGTG CTTGAGGAGA 
36801 CAGCTTTCAC GTGTATGGTA TTTATAATGT AAACTCTGAG GGCCCAATTC 
36851 TTAAATCTAA AGGGCACTGG AAGAAAGA6T GTGGTTAGTT CAAATAATTT 
36901 GCTTTTATCC AAAGTGCTCC CTCCGGAAAA AGTAGGTCTC TGTAGGTAAA 
36951 ATGTGCCTTC CTGACTAAAC AGCTCCTCCA CCCTGCCTAT TGAGCTGGGG 
37001 CAGTGACAGG AGCCTGACTC CTCTCCCTGC CCAATTTTCC CCTCCAGCCT 
37051 GGCTCAGCCT CCCTGTAGCA TATGTCACAC TTCCTGCCAG GTTTATTTCT 
37101 GCAGCACCCT GCAGGAGACA GCAGTCTCTG ATTCACAGAC CTCATGTTAT 
37151 CCTTAGATGC CTCTTGGATT TTGCTTCACT TTTCCTGGCC CTGT CTGTGA 
37201 GTCTCATCTC CCTTCAACAG GACGATGCTC AGAAGACACG GCTGCTTTTG 
37251 GTCTTCAAGT GTGTGCAGTT GTTTTTCCCT TCTGTGATCT 6TTGTGACTT 
37301 AGCATTGCAT TGTCATCCTG TTCAAAAAGG CAGCCCCCTT TATGTCTGAG 
37351 A6CACTCGCC TCTCTCACCT TCCTTGGAGA CTTTGAAGTA ATTGTGG6AC 
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37401 TCAGTAGAGG CCTTTCAT6G CAGCAGCAAC TTAAATGTAT TTATGCGCGT 
37451 TCATTTTGTT CTTGCTTCTC TGTTCTTTCA GATCTT TCAG CACCGTTGGT 
xArT-ATrrrl tttTAGATCT TTAATTGATT TTTTTCATTT ATATTC ATAA 

S V^'^c^ l^^^ S 

ii^i i^ssn ^i^.^. ^^-^ 

l77?l f^^^ISmc amSottgg agaagagggg gagggaaaac aagtatttca 
'7801 tS^IS^g ^^^gt tatgatttca tatcggac^s tatctacttc 

^TRSI TAGCCCATAT mTGGAAAT GCGGACTTAG CAGGTCACCT I^I^TLl^w 

37901 ^SgtSg Iagaggctgg ccccacctgt ggagtctgga gttgtaggat 

?4nci rAiriv-TTTT TTAGATTTCT TTGGAGCAAT AACCCATCCA TCCTTCAGTG 
laool ^cmIctg I?^ctctgtc TCATTTGCCA TGTGAAACAT tttacttcag 
38051 ^^T^A^S ^^I^^AGA AACCTATTTC TGAA6ATATA ATTA^ 
38101 ATCGCATCAT CCAAGAA6CC TGTTCAGACT GGAATGCAGA 6CTGCAAAAL 
38151 ATTCCAAGCA GTCGGATTTT TAGAGGATGA A GCTTCCA GG TiCCAAACWGA 
38201 GTAGCTTCTT AGTACCTTTG GGCCTTTCAC ACTTTTTACT CTTGCAGCTA 
38251 CAGTGAAGAA GAGCAGCATC ATTAATTAGC TGTGTAACCC TGCCACCCCC 

^'S^l 'CT^'CT^CT^ I^G^g '^J^ 

Src^ ^c^^c sss^ 

i?^c T^c?^^S Si 

i ?S T^^J^ ss-^- 

ilfoi WCT^TTG OTGGATCATA TTAACTAAGG TACTCCTTTC ATGTTG6ACA 

ssss ss^ss r^s m S 

^RR'il TTATAGACAT TTAAAAGTAA CATGGTGAAA CCCCGTCTCT ^^^'^^77^7 

^S^S^ SIS^S ^aa^^S^gH gg^a 

^A^^c^^^ S 

39101 xSEfTGOTA AGTTTTAACA GCTTTCATCA TAATAAAATA GCAGCAAGAG 
39I5I CTCCCAGCAC AGGAG^TA AATGGCCAGC GTATTTCGTA AGTrCGCTTT 

II25I T^^^^ ^C^^GG CTC^^ic ^^Ecfr^ TGTGAACATG 
39301 ISSa ™^GTTGCr TCAGCAGTTT AAAAGCTCAT ATTCTTTCTG 
^Q^^l TCTCTTGACT CGAAGGGAAG ATGTTTTGTA ATACTGTTGG AGCCCTtI 1^ 

39401 Ictaatcatg TGGTCGAGCT GAGGTTGTCC tctgtccccc cttttctaca 

iS^s ^^^0 i^^c '^^^fi f^:^ 

i ^G^a^^ ^^^-caTa^ "c^i S 

39651 CATTTAGTGC CAGAAGGGGT TTTATTTGCC CCACATGTCT gcatagtcga 

vB^^ i^^c^ G^tsss s^^jsr^ ^^Ji 

Si^C I^^S^ ^CA^^ 
39901 GCTAGGACAG GAAGAAGGAC TTCCCTTTGC AGCCCTGTGG TCCTGGCTTT 
39951 ^GGAGAG AAATGTrOT AAAATCTCTA TTAAGGATAT TTTTATTAGG 
AnriM rATTTTT^C TTATATAGTG GTGAAAACAA GAACAAGTTT TTAGATTACT 

40051 wSCSw StSStSAg CGGAAGATCT ttgtccaatc agaggaaaaa 

40101 ^<^CCC AATCTTCTGT TTCTGTTTCC ACTTAACTCC CACCACAGAG 

S ^I^^G^S I^^T^S^ ^iSS ^^^^ 

^G^C^SfG^ ^cS^S^g '^^i 

Si iiii s Si :f II s 

40501 tct^actSag gtgaggaaca tatcccagaa cacagtccta agtgactaac 
40551 actggagtgt atagttcctt agaatttcag agttgggcga gacttc^ 

A^\cif\^ AT-rArrrArrr TArrArATTT CACAGGTAAA CGAATGAACT GAbfatH-AW 
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40801 GTCrCAAAGA GTITAGGTAA CTAGCCACT^ AGTGGTA^G CTGA^GTT^G 
40851 AGCCTG6GCA TTCCAAATCC AGAGCCTACA ^ ^g^^. CTTCCACCCC 
40901 TCCCTGGGGC TCAGTTCTCC CGATTGTlA^ aTGACATTAG GCAGGGAGAC 
40951 ACAGTAACAC CTGGCACCAT CACTGCAAG^ ^l^^.-^^^.^ CAGCAGACCA 
41001 CCAGACCCCA GAGA6GGCAA ^GTCrrGGT GTCGGTACCA 
41051 GCTCTTTTGT TGA6CGTTAG ATGACtI Ai,^ caTATTCAAC CCTCAAGGTT 
41101 TCAGCCTACA A-TTTTACCTC CAATTTTCTC ^- TGAAAACATA 

41151 TGGGGGATGT CACCAACTCT ATTTAGAA^^ CTAAACTTTT TGCACACTTC 
5l201 GTAGTrTTGA GTTTCAGGGA AATTAAAAGC CTAAACTTTT ^^^^^ 
41251 TCAAAGCCTT GAGATCG6TG AAGATG^aa aTATAACACT 
41301 GATTAATAGA AGAAGGAAAG ATGAAGAAt-i ^ ^^-^^ tTTGCTGTCA 

41351 GTGGCAGAGG JGGATCTGAG actcgawc ^all^ aagaggtcag 
41401 ggctttggat cccttgtcca attccacti^ Jt^PStggtg catccctgtg 
5l451 ggctgatcgt gtgctgggtt ctccaccatg caccatggt^ aggatccttc 

41501 AAAGCTAGCC TGGGTATCTA CCTCTCATTT ^^^^^ tTCCTTGCCC 

41551 cactgatccg ccagactccc tgtctcccct ^^^^^ GGATCCTGGT 
41601 atcaggtctg tgaggttatg ggccaggggc ttg^^^c^ tctctttcca 

41651 CCCAGCTCTG TTGCTTCCTG JCGTTTACAJL CAGAGGCAGA 
41701 TAGGACTGTA GTAATATTGT GSAGTTACAT 6GTACCACTA 
41751 TTCACCTCCA CTAGTGAGTG CTTAGCA^lfa TGTGTGCTGT 
41801 GACATTCTGC AGTAATGGAA AT6AATGGAA ATGTCL^m. 
41851 CCATTGCAGC AGCCACTAGG CACCAGTGGT TGTTGAGt^l 
41901 GCTAGTGTGA ATGAAGAAG^ GGATTTAATT TCATTT^^ ^ggGTAAGTG 
41951 GTAAATAGCC GTCCGTGGCT ATGTTGGACA ^ ^ AGTCCATTTG 
42001 TGAGGCAGGA GGCAT^T^C CATTCTTTCC "I'^^^j^ aGAAAAGAGG 
42051 CGTTGTTATA AAGAAGCACC TGAGACTGGG TAATTTA ^^^^^^AGC 
42101 TTTATTTTGG CTCATGGCTC TGCAGGCTOT ACA^ GCAGAAGGCA 
42151 ATCTGCrrCT G6TGAGGGCC TCAGGAAGCT aGAAGGGAGG 
42201 AAGGGGGAGC aggctttata tggg^agaca CCAAGTCATT 
42251 TACCAGGCTC TTTTAAACAA CAGCTCTCTC ^ ^ GGCCCCACCT 
42301 CATGAGGGAT TTGCCCCCAC GACTCAAACA ^}^^^^^^ GATCCAAACC 
42351 CTGACATTGG GGATCACATT TCAA^iuwj ^ ^^^y^- jTACTGCAGG 
42401 ATATTACCTG GTAAGTCCTT GTTTCCACAT GTCTC|LA^ aATCAACTTT 
42451 GAGTGCTATT CTCTTTTGrT TGTrrTTATG GCK^^ TAAATGCAAT 
42501 AGACATTTCA GTTTAAAGTG TTT^TJ-^ AGTGATC6AT GTAGACATTG 
Ji7K^^ rrAATCCTTC AGCTGCTCAG CCAAAGAAGC AGT^iv-Jja^ r,CTr,CCTTCC 
i266i GCTGCCTTGG actgagatgt ^J^^^X ^XSX?^C ^GAAGCAA 
42651 TTAGAGTGAC TT^^CTGCAT TTTCGCTTTA ^-^ AGGTAGTAGA 

42701 ACCTCTCATA TAAAATGTAA CCCTCTC6TA "^ ^^^ TGTAGCCCAG 
42751 TAAGCTCTGG ATGTCT6TAT CAAG^GGG AGCA|C ^^AGGTTAGT 
42801 CAGTAGGAAA GACAATCTGT GAAACTATAT ^^^^^ CATGGCCAAA 
42851 AACTAACAGG AAGTCATGCA ^Tt I aw-aw ^gctaAGACA TCCCTACTGT 

i290l aagatgagta ctaatgatga taagjttaac agctaagaca ^^^^^^^ 

42951 ACACCAGGCC TTrTGTGAGG GACCTGCATA A^^ CAACAAGAAA 
43001 CATCTCTATG ATTCAGGAGC AGTTAATATL ^ ^^^^ TCAGCTAATT 
43051 ACTGGGGAAT A6AAA66TAC ^ATAtCi CCTAGTTCCA GAGCCCACAG 
43101 AGCAGCAGAG CCAGGATCTG AAC^CAAG^A ^^^^^^G GTGGAAAGAT 
43151 GCCTCAATAA ACCTGTGAAA CACTGGCCTT ^^J^.^^ GGTGTATTCG 
43201 CGGT6AGATG GGAAGCGTGG GGTCAGTGGG ^AC^^ gACAATGGCT 

43251 GTGAAGCCTC ctcctgctta cagcactgtc tggca^ accattggtc 

43301 ggtatggcac ggaagccgat gg^cctcct tcagatgtga 

43351 ttcgtcagtt cctccttcct g^tcacccg tggggcttcg 

43401 gagccagtgg gtgtcctgtc acagagatal gtatcaaaat 

43451 ccaggggtca gcctgcagat agaactgcti ggcggggttt 

43501 GCTCTGTGAA ATGCGGTTTT ATCACGGTGT CTTTCCAGAA ^^^^^^^ 
43551 CTTTTCCTAT TTGGTTTCTT GTCAGTCAGG ^ GTCCACCTCC 

43601 GCTCCCTGAG TGGTAAGAAA AT6AGCAGCT TTGACCTGAA 
43651 TTTTCTTCTC CCTACCCTCC CTCCTTGGGT A^ GTGCTGTAGC 
43701 ATGTAATTTG GTTCCTTTTG ^ACAGAAA^ caGAAGATCC GGGTTCCGGT 
43751 CGGAAAAGCT 6AAA6CCTGG GGACCGGAGC CA6AAGATLL ^^^^^ 
43801 CCCAGTTCTG CTGACCTTGC AGTGG6AC6C ^AGTl A^^-^- tttCTTAGAG 
43851 GCATCTATTT CACAGAGATT GAACTGGACA ^ aCAGTAGGGA 

43901 CTCTCAATCC TATAAGATGG ACA6ATGCTG ^ GTCCTCTCTC 
43951 GACAGACCTT TCCCA6ATTC TGGGCATCTT TGGTCATCTC 
44001 CCTGCAGGCA TCTTGGCCAT CCTCATCCg. ^^^^ ^ aTCCCACCGC 
44051 CCTGGTGGGC TCCGTGAGCA GCA6CGCCCT CCCTCACCAT 
SSS^^ ^C^S G^^GG CTTCGTGGGC TTTGTGGTGG 
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44201 GGACCTATGA GGCTCTCTAT GAGCTGATCC AGCCAAGCAA TGCTCC^TC 
44251 TTCATCAATT CCACCTGTGC £1;^^^^^^ GTTACCTGTC CTCAGAGCCT 
44301 AGCTGCCTAC CCCTGCCCCA TGTGTCCCCC gTGGGAACCC 
44351 CAGGTATGGT CCAGGCTCTG AGGAAAGTCA 6GGCAGGGGA 
44401 CTCTGCCTGG CACCTGGATA CCCTGG6CC^ ^GTAA^^ gGGCTGCCAT 
44451 GAGGTGGGGT GGCAGACACG CA6AA6TGCT i^^^^^ CCCCTCATCA 
44501 CGCrCACCTG TACCTATTTA C^CCAGA^ ^^^^Sc CTCGCCCAAC 
44551 TGCCTCCTCC TTCCTACCTG CCTCCCCTCT ^ ^ TGTCCCCCAC 

44601 TCATTCTTAC TGC ACAGT TC ACTTTAl i i a ^ TAACT6GGAA 
44651 CTCATGTTTT CACCTTTTAC ^GGGCCAGGC ^1 A^^^ JCCCAAATGT 
44701 CGCCCCCTCT TTATAAAGCT GG6CTTCTTT CTCATCTCTC ^^^^^^^^ 
44751 TGTATACTCA GTATTCTTCC JATTCGAGTC l^^^^^ ATCTGGACTC 
44801 ACCTGGTCAT TTGAAACAGG ^CCCCAA^ui 'f!^^^^^^^ TGGTATTCCT 
44851 TCTGGCTTGC TGTGACCCCT A^^JGC TTCTCTTCCC ^^^^ 
44901 TAGT6TGGGT CACAGTACTG TGTTCTIAIjI AAAATCAGAT 
i4951 ACGAAOTGTT GCCTAAACTG AAAATATTTA TCTTTTATTT 

45001 rnTGrrTTT agactgtctt agatctgggg ctattacgaa ^^^^^^^ 
45051 ttcagtaaac tttgactcaa cttctcctgc I^^^c^ aaggttttaa 
45101 atgtctgcat gggtcctcgg cactcttcgc ^^^^ ^^(-(-CTC 

45151 TCAGGATCGT CTAAAAATGT ACCTCG6T6A gaCTGCAGAA 
45201 CTGTTGACCA GCCTGGTTTC ATACCGAAAA g AGAGGCGTAT 

45251 ATGTATGGGT GCACC6GGCC GAGGGAA^^^ {ffcAGCCTC AGCCCACGCC 
45301 AAAATGGGGC TGT6TGCATG CAGGCCCATG GACGACACCA 
45351 AGGTGAAAG6 ATCA6CAATG CTCTGTTGCC AGAGTrTGAA 
45401 GCTCTATTGC CACCGATGAG ^AGCTGA^^i ^^^^TAC GCGCAGTGCT 
45451 ATTAA6TTAA TAGACTTTAC AGCAGCTGGT CTGACACTAL ^^^^^^^^^ 
45501 CGGTTGTTTA CAATCAGTGG GGAAAAG6GC AGAACCA^^ aCACTAATTA 
45551 CACTGCCTCT GTGGCCTGGA ^^^^ CTGGGTTTCA CAAACAGCCT 
45601 TGAGCCCTGT CTTTCCCCCA GAATGCCTCC GCAAACTTCA 
45651 TGAGGTTGGC CCTCCTCAAG GTCAGCCTTC AGAl^.^ CTCTTCTrTC 
45701 GAGAAGGCAG AGGAAGATAC ATTGCCTTGC TCCCCCTOT 
45751 CTCTTGGTGT GC6AAGTATT TCAGAAGGt.^ CGTGCGTGTG TGTGTGTGTT 
45801 TAGCTGTGTA TTT6TGCACG IGTGTGTGTA ^GTGC^ CCGTCACCAG 
45851 CCTGTGTAAG TAACAGACCA GACTCCTTTT ^.^^^-^^ GTTGAAGGAG 
45901 GCTCTTGCTT CACTGCAGAT ACAGTTCACT ^TGAAA^ CCAAAGCTCA 
d^QSl AGCAGCAAAA ATGTATCAGG GGTTTTGCI I ^i^;^;' ,^^ tctctTCTGT 
46001 TAAGGGCTGT GACCCACCCA ' mGGgAIgGG A^GTGGAC 
46051 TCCAAAGCCA GGAGAGCTGA CTTCCAGGTG AAGGbA ^^TTAGGAAT 
46101 TCTCATTGTA GTGACTCCCA ACCTACCTAA ^AAl^ gATTTTCACC 
46151 AT6CTATCAT TCTl^CTTG TTCTT^lA ^ ^^t- gtGACCAGGA 
46201 CACCCTTTCT GTTCTAT6GT GGACTCTTAA ^AG^^ CAGGCTTAGA 
46251 ATCTAGCCGG GAGTAGCAGA GGCCCTGTCT ^^^^ CAAGTTCGGG 
46301 AGTTACCAAA GTGGGCTCAG AA^GTCAT ^. GGTGCTAAGA 

46351 CTCTGGCAGC CCAGCCGCTA TCTTAGCTGr ^^^^ aCCTGTGTCC 
46401 GTGGTCTCAG XJAGAAGGTA gatgccaaci ^ ^ gctgctagag 
46451 tgcccatgtc ctccttggtc gacgtttctg gttgggacct 

li! S S « SIS? ^ » 
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start: 13181 

Exon: 13181-13323 

intron: 13324-17943 

ixon: 17944-18034 

intron: 18035-20533 

Exon: 20534-20622 
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intron: 

Exon: 

intron: 

Exon: 

Stop: 



32781-35343 
35344-35513 
35514-44007 
44008-44277 
44278 



CHROMOSOME MAP POSITION: 
Chromosome 5 



ALLELIC VARIANTS (SNPS) : 

DNA 

Position Major Minor 



2064 
2119 
2121 
2123 
2125 
2825 
3288 
6172 
6462 
7031 
7671 
8466 
9097 
9108 
10170 
10966 
12987 
13111 
13120 
13822 
14891 
15207 
16162 
16364 
16411 
16636 
16802 
17111 
17276 
17372 
18317 
18342 
21828 
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31344 
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32570 
33220 
33525 
34589 
34832 
35188 
3S614 
37852 
38643 
39198 
39550 
42281 
42321 
42563 
42675 
42908 
43358 
43371 
44796 
45820 

context : 

DNA 

GACCTCG<X:CTCTGGC6TG6GCGGTGG©VTCACOTGATGA6GTCCG^ 
CCAGAGCCGGAGCCGGAGCTGGGGCCAGAACCCGAGCAGTGAGTTCC^ 

I^G^SS^S^G^S'a^^c^^^^SS^c^fc^Si 



T 


G 


intron 


T 


C 


Intron 


T 


G 


intron 


A 


G 


intron 


G 


T 


intron 


A 


G 


intron 


G 


C 


Intron 


C 


A 


intron 


G 


A 


Intron 


G 


T 


intron 


T 


G 


intron 


A 


G 


intron 


G 


A 


intron 


G 


C 


intron 


G 


A 


intron 


G 


A 


intron 




G 


intron 


G 


C 


intron 


G 


A 


Beyond ORF* 


A 


G 


Beyond ORFi 



2119 



2121 



2123 



CGAGTTCCGGCTGGCGGCGCTCGCCGCCrrGGGCAGGACCCACCTCGC 

^^STGCTcJiGCT^GGCACTGGATCC^^^^ 

GCGTCCCCGGGCCGCAGCTGCGGTACGACGCTGACACCC^^ 

TGGAGATCCCTTGTCCCTCGCGCTATCrCCCTTGACCTOTGGGGT^^ 

CCTGTTTGACTGACAGGTGGGGGAAACTGGGGTAGATGCT^ 

AGTCCCCAGCGCCCAGGAACATAGTCCTTCCAGCAGTGGCAOTMT^^ 

Ig-^ccggctggcggcgctcgccgccttgggcaggaccc^ 

agccagagccggagccggagctggggccagaacccgag<^gtgagttcctccact 
ttcSgctggcggcgctcgccgccttgggcaggacccacctcgccttcctcccggcot^^ 
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CAGATGCTCCAGGTCAGG(^CTGGATCCGCCCGGGCTCTGGCTra^ 
CCCCGGGCCGCAGCTGCGGTACGACGCTGACACCCCTCTCTGM 
GATCCCTTGTCCCTCGCGCrATCrCCCTTGACCTCGTGGGGTTGGG^^ 
TTTGACTGACAGGTGGGGGAAACTGGGGTAGATGGTGAAGATAACCCAAAGGACCATCTA 

2125 CCCAGCGCCCAGGAACATAGTCCTTCCAGCAGTGGO^GTAATAGCTCG^ 

GTGGAG(^GAGCTCCGGAGCTCAGTGAGAAAAAAGGCGCGGCCGCTCAAGG^^ 
ACCTCGGCCTCTGGCGTGGGCGGTGGGATCACGTGATGAGGTCCG^ 
CAGCAAAGGAGGATGGCGAGGGGCTGATACTGAACCCGGGAAGGGTG^^ 
CCAGAGCCGGAGCCGGAGCTGGGGCCAGAACCCGAGCAGTGAGTTCCTCCACTGACGAGT 

Sggctggcggcgctcgccgccttgggcaggacccacctcg^ 

GATGCTCCAGGTCAGGCACTGGATCCGCCCGGGCTGTGGGTCCGCGACTCOT 

CCGGGCCG<>GCrGCGGTACGACGCTGACACCCCTCrGTGAATTG6GCGA^^ 

TCCOTOTCCCTCGCGCTATCTCCCTTGACCTCGTGGGGT^ 

TGACTGACAGGTGGGGGAAACTGGGGTAGATGGTGAAGATAACCCAAAGGACCATCTAGG 
2825 AGAACTGCAAGGACGCTGGGAAGTCGTCTGGTGCAGCTCCCTCCTAGGAC^ 

actSagcccttactccgggaaggggtaagggcttgco^^ 

GGAGACCCGGAGACCTGCGACTAGAATGCAAATGTTCCTAAGCITCAGCA^ 
TTTCGCCACACCGCCTCCTGCGGGAMCrr^^ 

TTTCTCTTTTAGTCCTCTCCCTTTTTAGCTGTCTGCATTTTCCACCGCrGGGGTTG 

GCTCTGGGTGTGGTTCCCTGTTTGTTCArrATTrrrCTGCAAACTC^^^ 

TTGGTTTCTAACCTTCCTGCATTCTATGTAAGTCACACCAAAAT^^ 

AATGTGCTTCTGGGAAGATAG6TGGCTGAGCCGA6GTT 

TGAAGAATGTAAAGACCTTTGCrrTATTTTTTCTGTAACTTGTCAGAm^ 

ATTTGGATGGACGTTTTGCAGTTATrrGAATTrTGCTGAAGATAGCATCATGGTGCAATG 

3288 AGAGCCCTGACGTTAACrTGAAGAATGTAAAGACCrrTGCTTTAl I i ' iJ^rGTAACTTG 

TCAGATrTGGGATTGCTTATTTGGATGGACGrmGCAGTTArrTGAAr^ 
TAGCATCATGGTGCAATGGACAGAACAGAGATTGGGGAATCAGGATAT^ 
CTGCCGCTTACCTGGCAACCTTAAGTGACTCGCGTTTGGGTTTCTCAGTCTAGACA^ 

T6G3;TTGAATTCTTAAGGGCCCCTrCTGCT^ 

TT GM ll ll l bl l lGl ll b l ll l l A AATAGAGATGAGGTCTCACTATGCTGCCCAGGCTG 

CAGGGGTGTGAGCCAGTGCCCCTGACCAGGGTCTG I I I G IM I » ' "ATTrcGAGAGATTT 

TACCCGCTGTGTACACTGAGTATCAGCCrrGCAAC^GACTTMTC^^ 

GCAGTTTCCrCTGCnTATTCCrCTGTTGCTATAAAATCCTCCTCCTCTTTCTTCCT^^ 

6172 GCrTCAOVATCCAGAGTTTTAAATGGAGCCCATACTG(^GACT^ 
TCACTTTCTGTCCTTGAACTTCTCTGTAGTAATAATCACAATO 
TTTATTATGTTCCAGGCAACATATCTAACATTTATTTATTTTTTCCTCCTATCTTC^^ 
SI^I^^SOTAd^™TTAATGACATCTTT<>GATGAGGA^ 
GAGATGAATTAACTTGCTCAGAGTCACACTACCAGACTGCAAA^^ 

TGCAQWKTACCCGCTGCAGACCTAATCCTGCCCCAGGCTCTGGGGCCAG^ 

^SdAGATmAAGGAGGGTATATAATTTAAGGTGTGGTA^ 

GCTGGTTTGCTGGACTGTCATCTCAAATCmGATTTGACTCATCCTGGGGC^ 

gtSgcctttgtgtgtgggccctggttttctgacccctaaga^ 

CTTa-CTAAAGCTATACCTGGCCCTAACATTTAGTGATCTTCATGGTTGGGAGTAAAAGT 



6462 



7031 



GCTTTTTATTGTGCAGAATACCCGCTGCAGACCrAATCCrGCCCCAGGCTCTGGGGCCA^ 

CrrrGTTCGCAGGGAGATTTTAAGGAGGGTATATAATTTAA^ 

TGGACTGGGATGCTGGTTTGCTGGACTGTCATCTCAAATr^ 

GGOTGGATGAGTCAGCCTTTGTGTGTGGGCCCTGGTTTTCT^CC^^ 

TGGAGCrrGACCTTCTCTAAAGCrATACCTGGCCCTAACATTTAGTGATaTCATGG^^ 

GACTLw^GTGTGCGTGTTTGCCTGTTCAGCAGCTGCrrTGTGCAGA^ 
AGCAGCTGCCCTGTAGCTGTTCTAGCATCAGACTCCTACAGGAAAAACT 
GAATGTTCTGCTCTGGTAAGTTGGATGGAATTCTATCTGATGCTGTTTTAAA^ 
ATCTA6AAGCCAAACCATTTTACTTCCCTCACTGTAGACCACACATAGCAA 

TGTCTTTCTTCAT^^ 

TCGACAGAGAGGAGAAAATACATCTGGGGAATTTGCCGCTGCTT^ 

FIGURE 3Q 
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IIZ^SSaacmSacaaaaaaacataggctgggcatgotg^^ 

GTrrGACACCAGCCrGGCCMWGATGAMCCCCCTCTCTACTA^ 



TCAAGAGTTTGACACCAGCCTGGCCMCATGATG^^^ 
AAAAATTA 

^I^?SI??SSS^fSAMT^GGTGGAATCAAGTT6GGCCCAGAAATCT^ 

^^^^ 

?^SS^AS?^^^^I^T2ST2?5nGCACAAAGTGAT^^ 

IcTGCCCATCTAAAAATGGCATATTrrGTAGATTGA^ 

AGCTASACAATAGTTACATAAGGAAAAAAAAGGAATGTmAA^ 

{•T in I I I I n I I I I 1 1 I I 1 1 1 I GGCCATCAAACTTGCAGACI I 1 1 I I I ACTCAGTTGCT 
^CTmCT^CTCTAAiwCTXXTGGAGAmGGAC^ 

GCCTTGGTTTCOVAAAAGAGCCATAGAAAGAACTCCGGGGAOTG^ 
C^CCTGCCT^COT^S;22SS;0C^M^ 

■ TiT i 1 1 H 1 1 1 1 1 r G AGACGGAGTCTCGCTCTGTCGCCCAGGCTGGAOTGCAOTGGC^ 

CAAAAAGAGCCATAGAAAGAACTCCGGGGAGTGGCTCTGCCCACT 
IS?I?5iS^S^^??SMTMCAAGTCT^^ II I I ill I I ill 1 II 



CTGACcrm 

TTTTGAGACGGAGTCTCGCTi 



'"""^ — CTGTCGCCCAGGCrGGAGTGCAGTGGCGCGATCTCGGCTC 



I5^^ct?HgcctScgg^^ 

5^Si^ili:/-^;=?I=r/TrxArrArr,CCCGGCTAATTTTTTGT 
CAAGTTGCT 
ACCCCTCTT 

CTGCAGTGT 



TTTAAAGACAGI I M M 
CATTCCTCATTTAGTT< 

TATACACAACATAAAGAAGTCTCTCTGCAGTGTTTGAGATAAATTGMC^^^ 
[A.G] 

TTCTTAAAG( 
CATCAGTTTi 



ACAAAATGCAATTmMCAGmAOTGAACTGlTTAC^ 

^AGC 
^AAT 
TCOS 

TGCrCTCACATTT^^CTi^^CTia^CCCGT^Tfr;;AXG^^^ 



^^^^lAAAGGCAAAAGCGAA^GGAGAGG^^^ 



FIGURE 3R 
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14891 



T(»OTOTAGCACCACGTATACCAATGGGA^ 



iSi^^CCCTTCCTGCCrrGATGTTGTGATACAGTGTAGGTGACCAGGGAAGCCrATCTGT 



FIGURE 3S 
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TGTCGGAAOTGTTAGAGACAACrmTrGAGCT^^ 



TTAG 



CACTGCAACCTCCATCTCCTGGGCTC 

^GACTGAAACTGGCTCTCTAAAGGTGA 
^^CTTTGTGGGCTTCCCAAGACCATn 
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^^S^S^^S^S^^^^^^™AAATACTAAATGTATCAGGGACrTCTGGCCr 
^ScTTTTTTTTTTTm 

i^'J^pi'^lcCTCTGTTGCAGTTTTGGGAT^ I I I I 1 1 1 I I 1 1 IGAGATGG 

S^SctctIttgStaggctggagtac^ 

CTGG^^^^^S^S^^ATGCTGTTCACTTTGTATCmCr^ 
S?^^^?ai^S^^^S^^lG^^ATCCTCrGTTGCAGmTGGGATCTGAGTCTTTT 

TnTTTTTGAGATGGAGTCTCCCTCTATTGCCTAGGCTGGAGTACAOT^ 
Id^ACTGCAACCTCTGCCTCCCGGGTTCAAGC^ 

gctSgS^t^^ggcacatgccgctatgcctggcta^ 

^Sw^^^SSSSSaAXcSGAGirGGAGCACCCCTCrGGTCTAATC 



18317 



AAAAATGCAGGCATCGTGGTAAG6GTCTG(^TCAGTGGAGAGGAGTGOTGAC^ 
r?Af?TAr7ii II I GI LG I n^lT AAAATGTACTTGCTTTAAAACATnTAAATAGAGAAG 
«^?S^^SIIIIfSS?SxSAAicGGAATT«^ 
"In^AGAGTGTTCTOTGCGTTAGGCACTGnCTAAGCTCTT^ 
TAAAT^CTGCCCTCATGGAGCTTACTTCATGGTGGAGAGGATGTACTGAGATGGCTC 



[T,G] 



rATTTASTCi^GTrGACTGCCTGCCCAAA 

?^S?S?^SSgStctac^Sagatggctcgagcagt^ 

TAATLGTTAGTTACAGATGTCTGCCCAni!F^A«CTCTCCCATGCCCTG^^ 

^^CCASGSAGAATC^TATGTmCTlTT^GTGATT^^ 

li^S^GS^foTTCGGAGTTGAAGAGAGACTCAGA^ 

FIGURE 3U 
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22822 



22683 Crrm/^GrTTTTTG^^GATTGA^^^ 

TTCCCCTTAMCGTCTGAGACGTTmCTCTA^ 

I^c^Ltgcagcattccagacgagmgcc^^ 

CACCTCCCTCAC<^CTTCCCAATGCCCCA6A 
SG^GCGAGAATGGCAAAAGATGATTGAAGTTTTTGTTTAGGATTTTTTCCAAATCAGCr 

FIGURE 3V 
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TrrGTCAACAAAA(^GTTAMGTrrrCATATrTTACATAGATCTAC^ 
TSS^^S^GCTCGGGCATAGAGAAAa^ 

TTTAACAAAACAGCACTTOTACTttOAMTCMTTA^ 

IIISSSS^STg^SSS^-tS^^ 



_ CCTAGGTACATATGA 
TGAAGTTCTCTAAAGATGGC 



CT,A] 

ATCAG( ^ 

CCACCrCCAGACGTOTTGTGGAmCTTCCTGATTCTCACCWGCTGG^ 

TCTATTTTGTGTTTCTGr 
AAAAAAAAAAAACCAGA< 
TTCCAAATCAGCTTTTG 
TCTTCTATTTGATTCCC 

feScCTGrCCTAGGTACATATGA^^^ 



CACCTGAGCqTTCCTCA6Qft«^ 

CACCAATAACTGcl^SSS^SScSmTGACGCCTACCATGGACTCGCGACr 



AGAGGTGTTTCCGTTTTC 



S^SS^^G^S^^ISi^^S'c^cI^GTCTTGrCGACCCTC^^ 

Sa^^acatatgatcaaacctagocagacaattg^ 



IIIS^^^?^SS^i?^^^^SS^^^^iS^ScAGGCAAGTCTGTGGGGT 

^^^Lgagagaacatctgtatacj^tg^^ 

^i^aS^SSCTCTS^S^CTGTATGTGATTGGTCTGTCTCA 

FIGURE 3W 
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^^s^^c?^^^^?CT^^^^^stss^^A^s^s^I^<^cTAGAT^^^ 

CACAAATACCTOTGAATACCATGTCCCCACCCt^^ 

^^T^LTCTACCTTGCmTrTCCACCT^ 
ATTCCTGGCTCTACCTTGTT^CTTGTTAGTTOT^ 
TGCCATTTATTTMCAAOTCCTWCTCAGTC^ 

26737 CTATCCATCCAGAGTGATCTCTATGCATACTACAAC^ 
AAGGGCAGCCCTGCTGAmmGAGAGCMTrC^ 
AGTTGTGCTGTGAT(^TAOTGCCOTCTGCCreGAG^ 



ATAAATGTTTTATCACTGCATATCGTrGCTTCTGAGATGTATrnrn 

FIGURE 3X 
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27423 



27735 



29875 



30356 



ATGGTAAGACCTCACTTAATATC^ 
ATGTATAGCGAAATCAGTTTTT^ 
TGACTGTACGTCATTTCAOTAMCTCTCA 

^^K^^^^^aST^ggt^^^^^ 
^^taLataatacacttcttctgctg^^ 

TGAMTTCCCAGTCTGATAOT 



31344 




32570 



TGMG<XTGTTATAGAGOTMTAAT6GATCTCCTm 
GCCTGTTMGAGCATCCaCOTATTATCCA^ 



CAACATGGTGAAACCCCATCTCTACTAMWACAA^^ 

^^G^^^rcIi^^G^^C^^C^'cS^'^^^ 
CGCCTATAATCC^GCCACTCGAGAGG^^^^ 

GGAGGlTGCAGTGAGCCAAGATCACGCWCTGWtllXAW.^^^^^^^ 

TCTGTCrCAAAAAAAAAAAAAAAA^ 
GTGAATAGTCmCGGGGATTC^TTGAGATT^^ 



ATATGTAATrTTAAAATGTTTAi 
[G,T] 




CTGATCACATGTAMT^CTGCTOracrGAG^ 
TAGGGCrmCATTGCCCATGAAGAOTGTCTCAC^ 
CTGCTGCTGGGGCAGAGGCCTGAGT^GCCWTACTT^^ 

^^a^ATTGmGmG^ITACCCMGAC^ 

ATCATrrGTTTTCAAGAmCTTTOTATTT^ 

CTAGATTGACTCTTAGTTTTCGATCTAGGGCTTG^ 

t^SS^^^'c^J^G^JJ^^miS^^^S^ACAGATACATTTAG 




^^^Jca^AfTTTTAGGACTTCCTAGCTAGG^^ 

tllGL^^^^CTCTCCTCTCTCACTACTCTCTCATAGGTTCTGCCCCTGGAAAA 



FIGURE 3Y 
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A^^^TCSIdo^CTCACTOTC^GCCGGGGATrGTTWKWOTCC^ 
mrLTTGGTGAATGTATTATAGGCWWTAGTCyVGCCATTTTGAAATGCTTCCTG^ 



rrrrrArArrA^S^STAGwSSACTG^^ 

^^^"^Si^CT^i^SGGTGTGCTTTTTAT^^ 
T^TTATTTTATTGTATTTTTTAAGACAAWGCACTOVGTATrrCC^ 

™f^^S?fGGCTCATCT6CCTGG^ 

™^S??J??SSS[S^^SSEtcagtatttcca«k5gct^^^ 



r^iSATAAAM-SGGM^TCO^ 
]SOTOTilmTlSTCCTmTTTTCrrrTTTAAGA^ 

Stotcaaawttgagattttcaaaaggtatatagagagcattm 
aTatc?^I^^J^^S^I?I"gtJa?Sa?^ 

^s^^^i^G^^*cSI^l^^^t^^^IXS^^^ 

TGCTGCACTGAGATATTTTGAATATAAGCTTTTGTGT^^ 

iTOCOTmfOTOTaciAScGCCCGAGCACTGTGAGlTACTGGTGGACCTGm 
i^^?mS^S^^S^cScC^S^^STGA6TTAGTGGTGGACC^GmGTGa^^ 

FIGURE 3Z 
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^^^^ 

MTTCTGGTG 
GAACAGGAGG 
ACCACrCTCC 



37852 irn--^-SSrcSI^S^cIIJf^^ 



TCCTCCTCCTCTTCACCCTLU^ 

"TC I U\i.^^^ I V- . ^v- (^GAGGGGGAGGGAAAACA 

rCGGACAGTATCTACTTCC 

38643 C^CCTCAACTGTMGATAGGA^GTGTCT^ 



GATGTTTTGTAATA< 



^^^^ 

FIGURE 3AA 
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42321 CATTCTTTCCCrGCTGTGTTAGTCO^TrrGCGTTGTTATAAAGAAGCACCTGAGACTGGG 
TAATTTATAAAGAAAAGAGGTTTATTTTGGCTCATGGCTCT6CAGGCTGTACAGGAAGTG 
TGATGCCAGCATCTGCTTCTGGTGAGGGCCTCAGGAAGCrrCTAATCATGGCAGAAGGCA 
AAGGGGGAGCAGGCTTTATATGGCAAGACAGGGAGCAAGGAGAAGGGAGGTACCAGGCTC 
mTAAACAACAGCTCTCTCAGGGAGGGCCCCAAGTCATTCATGAGGGATTTGCCCCCAC 
[G.A] 

ACT(j\AACACrrCCCACCAGGCCCCACCTCTGACATTGGGGATCACATTTCAACATGAAA 
TTrGGAGGGGATCOKAACCATATTACCTGG TAAGTC CTTGTTTCCACATGTCTCT CATC T 
TACTGCAGGGAGTGCTATTCTCI I I Idl I IGI I M I ATGGCTCCTCAAAAATCAACTTTA 
GACATrrCAGTrTAAAGTGTTTCrrAAAAATCTGGTCTCTAAATGCAATCCAATCCTTCA 
GCTGCTCAGCCAAAGAAGCAGTGATC6ATGTAGACATTG6CTGCCTTGGACTGAGATGTT 

42563 TTAAACAACAGCTCTCTCAGGGAGGGCCCCAAGTCATTCATGAGGGATTTGCCCCCACGA 
CrCAAACACTTCCCACCAGGCCCCACCTCTGACATTGG GGAT CACATTTCAACATGAAAT 
TTGGAGGGGATCCAAACCATATTACCTGGTAAGTCCI IGI I I CCACATGTCTCT CATCT T 
ACTGCAGGGAGTGCTATTCTCI II Idll IGI I I I I ATGGCTCCTCAAAAATCAACTTTAG 
ACATTTCAGTTTAAA6TGTTTCTTAAAAATCTGGTCTCTAAATGCAATCCAATCCTTCAG 
CG,C] 

TGCTCAGCCAAAGAAGCAGTGATCGATGTAGACATTGGCTGCCTTG GACT GA GATGT TCT 
GGCAGTCTCACCAGTGTGGTGCCTTCCTTAGAGTGACTTGACTGCATTTTCGCTTTACAG 
AATGAACTTAGAAGCAAACCrCTCATATAAAATGTAACCCTCTCGTAGGAATCAATGAGG 
TAGTAGATAAGCTCTGGATGTCTGTATCAAGGCTGGGAGCATCCAGCTGTAGCCCAGCAG 
TAGGAAAGACAATCTGTCAAACTATATTTGATTGCTAACAGGTTAGTAACTAACAGGAAG 

4267 5 CATGAAATTTGGAGGGGATCCAAACCATATTACCTGGTAAGTCCTTGTTTCCACATGTCT 
CTCATCTTACTGCAGGGAGTGCTATTCT CI I I IGI I I G I I I I I ATGGCTCCTCAAAAATC 
AACTTTAGACATTTCAGTTTAAAGTG I MCI I AAAAATCTGGTCTCTAAATGCAATCCAA 
TCCTTCAGCTGCTCAGCCAAAGAAGCAGTGATCGATGTAGACATTGGCTGCCTTG GACT G 
AGATGTTCTGGCAGTCTCACCAGTGTGGTGCCrrCCTTAGAGTGACTTGACTGCATTTTC 
[G,A] 

CrTTACAGAATGAACTTAGAAGCAAACCTCTCATATAAAATGTAACCCTCTCGTAGGAAT 
CAATGAGGTAGTAGATAAGCTCTGGATGTCTGTATCAAGGCTGGGAGCATCCAGCTGTAG 
CCCAGCAGTAGGAAAGACAATCTGTCAAACTATATTTGATTGCTAACAGGTTAGTAACTA 
ACAGGAAGTCATGCACTGTAGCAGGATGTACTTTTCATGGCCAA AAAGAT GAGTACTAAT 
GATGATAAO^TTAACAGGTAAGAOVTCCCTACTGTACACCAGGCCTTTTGTGAGGCACCT 

42908 TGGACTGAGATGTTCTGGCAGTCTCACCAGTGTGGTGCCTTCCTTAGAGTGACTTGACTG 
CATTrrCGCTTTACAGAATGAACTTAGAAGCAAACCTCTCATATAAAATGTAACCCTCTC 
GTAGGAATCAATGAGGTAGTAGATAAGCTCTGGATGTCTGTATCAAGGCTGGGAGCATCC 
AGCTGTAGCCCAGCAGTAGGAAAGACAATCTGTCAAACTATATTTGATTGCTAACAGGTT 
AGTAACTAACAGGAAGTCATGCACTGTAGCAGGATGTACTTTTCATGGCCAAAAAGATGA 
[G,A] 

TACTAATGATGATAACATTAACAGGTAAGACATCCCTACTGTACACCAGGCC I M I GTGA 
GGCACCTGCATAACCTCATTTGACCATCATGACATCTCTATGATTCAGGAGCAGTTAATA 
TCCCCATTTTGCCAACAAGAAAACTGGGGAATAGAAAGGTACCATACCTTCCCCAATGTC 
ACTCAGCTAATTAGCAGCAGAGCCAGGATCTGAACACAAGAACCTAGTTCCAGAGCCCAC 
AGGCCrCAATAAACCTGTGAAACACTGGCCTTTGCCCACCTGGTGGAAAGATCGGTGAGA 

43358 AATAGAAAGGTACCATACCTTCCCCAATGTCACTCAGCTAATTAGCAGCAGAGCCAGGAT 
CTGAACACAAGAACCTAGTTCCAGAGCCCACAGGCCTCAATAAACCTGTGAAACACTGGC 
CTTTGCCCACCTGGTGGAAAGATCGGTGAGATGGGAAGCGTGGGGTCAGTGGGCACTAGG 
ATGGGTGTATTCGGTGAAGCCTCCTCCTGCTTACAGCACTGTCTGGCAGTGTTGACAATG 
GCTGGTATGGCACGGAAGCCGATGGCACCTCCTGCGGCAGTGCACCATTGGTCTTCGTCA 
C-.G] 

TTCCTCCTTCCTGGCTCACCCGTGGCTGAGTTTCAGATGTGAGAGCCAGTGGGTGTCCTG 
TCACAGAGATACGGTCGTCGTGTGGGGCTTCGCCAGGGGTCAGCCTGCAGATAGAACTGC 
I I I I I I I CACCTGTATCAAAATGCTCTGTGAAATGCGGTTTTATCACGGTGTCTTTCCAG 
AAGGCGGGG I I I C I I I I CCTATTTGG I I I C I I GTCAGTCAGGTAGAGATGTT TGTGT TGG 
AGGCTCCCTGAGTGGTAAGAAAATGAGCAGCTGCTCAGGAACGTCCACCTCCI I I ICI IC 

43371 CATACCTTCCCCAATGTCACrCAGCTAATTAGCAGCAGAGCCAGGAT CTGAA CACAAGAA 
CCTAGTTCCAGAGCCCACAGGCCTCAATAAACCTGTGAAACACTGGCCTTTGCCCACCTG 
GTGGAAAGATCGGTGAGATGGGAAGCGTGGGGTCAGTGGGCACTAGGATGGGTGTATTCG 
GTGAAGCCTCCTCCTGCTTACAGCACTGTCTGGCAGTGTTGACAATGGCTGGTATGGCAC 
GGAAGCCGATGGCACCTCCTGCGGCAGTGCACCATTGGTCTTCGTCA6TTCCTCCTTCCT 
[G.C] 

GCTCACCCGTGGCTGAGTTTCAGATGTGAGAGCCAGTGGGTGTCCrGTCACAGAGATACG 
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GTCGTCGTGTGGGGCTTCGCCAGGGGTCAGCCTGCAGATAGAACTGCI I II I I ICACCTG 
TATCA AAATGCTCTGTGAAATGCG GI I I I ATCACGGTGTCTTTCCAGAAGGCGGGGTTTC 
TTTTCCTATTTGG I MCI I GTCAGTCAGGTAGAGATGTTTGTGTTGGAGGCTCCCTGAGT 
GGTAAGAAAATGAGCAGCTGCTCAGGAACGTCCACCTCCI H ICI I CTCCCTACCCTCCC 

44796 GCCATCGCTCACCrGTACCTATTTACACCCAGAACTTTCCAGCTCCCCCTCATCATGCCT 
CCrCCrrCCTACCrGCCTCCCCTCTGCTGGTGCACCrCGCCCAACTCATTCTTACTGCAC 
AGTTCACTTTATTTAACAATTTTCATGTCCCCCACCTCATGTTTTCACC! I I lACTGGGC 
CAGGCATAGATTAAGTAACTGGGAACGCCCCCTCTTTATAAAGCTGGG C II C I 1 I CTCAT 
CTCTCTCCCAAATGTTGTATACTCAGTATTCTTCCTATTCGAGTCTCCAGGGGGTGGCTG 
[G.A] 

ACCTACCTGGTCATTTGAAACAGGCCCCCAAGCTGGAG I I II I AATCTGGACTCTCTGGC 
TTGCTGTGACCCCTAAGGCAATGCTTCTCTTCCCTGGTATTCCTTAGTGTGGGTCACAGT 
ACTGT GTTCIT AGTTGCTTTAGCTCTTAAAACATACGAAGTGTTGCCTAAACTGAAAATA 
TTTATCI I I lATTTAAAATCAGAI I I I IGI I I I I AGACTGTCTTAGATCTGGGGCTATTA 
CGAATCAC I I C I I C I I CAGTAAACTTTGACTCAACTTCTCCTGCTGAAAAGAAGCTCGCT 

45820 GGGAAAAGGGCAGAACCAGTGCCCGGCCCCACACTGCCTCTGTGGCCTGGACTTTGAAAG 
GAACCCACTGAACACTAATTATGAGCCCTGTCTTTCCCCCAGAATGCCTCCCTGGGTTTC 
ACAAACAGCCTTGAGGTTGGCCCTCCTCAAGGTCAGCCTTCAGATTTGGGAGCAAACTTC 
AGAGAAGGCAGAGGAAGATACATTGCCTTGCTGTGGGCTGCCT C I I C I 1 ICCTCTTGGTG 
TGCGAAGTATTTCAGAAGGCCATTGATGAATTCCCCCTCTTTAGCTGTGTATTTGTGCAC 
[A.G] 

TGTGTGTGTACGTGCGTGTGTGTGTGTGTTCCTGTGTAAGTAACAGACCAGACTCC I I I I 
CTCTTCTGTCCCGTCACCAGGCTCTTGCTTCACTGCAGATACAGTTCACTCTGAAAGCTG 
GTTGAAGGAGAGCAGCAAAAATGTATCAGGGGI I I I GCTTCTGTGTTTCGCCAAAGCTCA 
TAA66GCTGTGACCCACCCATAT6GCCCCAGI I I I I I CTGTCTCTTCTGTTCCAAAGCCA 
GGAGAGCTGACTTCCAGGTGAAGGGATGGGAAAAGTGGACTCTCATTGTAGTGACTCCCA 



FIGURE 3CC 



